The Scrapelect Book

Welcome to the scrapelect book. scrapelect is a declarative web-scraping language, where you describe how to find data on a web page and how to filter and process that data, then get output in a structured, machine-readable format.

scrapelect is currently in development, and the language and interpreter are changing. This book aims to be up to date with the latest released version (currently v0.3.2). If something is inconsistent or incorrect, please consider submitting an issue or pull request to help improve the documentation.

Helpful links:

  • GitHub repository: contains the source code for scrapelect (and this book).
  • docs.rs: lists developer documentation for contributing to or extending the scrapelect interpreter, as well as user documentation for scrapelect's built-in filters.
  • GitHub issue tracker: the place to search and file issues to report bugs, request features, and ask questions.