Quick Start
Scrape any URL in one line:Scraping Methods
Notte provides two ways to scrape:| Method | Use Case |
|---|---|
client.scrape(url) | Quick, one-off scrapes |
session.scrape() | Scraping after navigation or authentication |
Quick Scrape
For simple scraping without session management:Session-Based Scrape
For scraping after authentication or navigation:Structured Extraction
Extract data into typed Python objects using Pydantic models. The extraction is powered by an LLM that understands the page content and extracts the specified fields.Using Pydantic Models
Define a schema and extract matching data:Using Instructions Only
For flexible extraction without a strict schema:Extracting Lists
Extract multiple items from a page:Nested Structures
Handle complex, nested data:Image Extraction
Extract all images from a page:Configuration Options
Content Filtering
Control what content gets extracted:Links and Images
Control link and image extraction:Scoped Scraping
Scrape only a specific section of the page:Link Placeholders
Reduce output size by using placeholders:Return Types
The scrape method returns different types based on parameters:| Parameters | Return Type |
|---|---|
| None | str (markdown) |
instructions | StructuredData[BaseModel] |
response_format | StructuredData[YourModel] |
only_images=True | list[ImageData] |

