AI Dose
0
Likes
0
Saves
Back to updates

[HN] Show HN: Trawl – Scrape any site with natural language fields, not CSS selectors

Impact: 8/10
Swipe left/right

Summary

Trawl is a new web scraping tool designed to overcome the common problem of scrapers breaking due to website redesigns. Instead of relying on fragile CSS selectors, Trawl uses natural language fields and leverages LLMs to identify data like "title" or "price" on a page. This approach aims to make web scraping significantly more robust and resilient to changes in website structure.

Continue Reading

Explore related coverage about community news and adjacent AI developments: [r/ML] [D] MYTHOS-INVERSION STRUCTURAL AUDIT, [r/LocalLLaMA] karpathy / autoresearch, [r/ML] [R] Agentic AI and Occupational Displacement: A Multi-Regional Task Exposure Analysis (236 occupations, 5 US metros), [r/ML] Building behavioural response models of public figures using Brain scan data (Predict their next move using psychological modelling) [P].

Related Articles

Comments

Sign in to leave a comment.

Loading comments...