--- layout: default title: "ContentScrapingStrategy" parent: "Crawl4AI" nav_order: 4 --- # Chapter 4: Cleaning Up the Mess - ContentScrapingStrategy In [Chapter 3: Giving Instructions - CrawlerRunConfig](03_crawlerrunconfig.md), we learned how to give specific instructions to our `AsyncWebCrawler` using `CrawlerRunConfig`. This included telling it *how* to fetch the page and potentially take screenshots or PDFs. Now, imagine the crawler has successfully fetched the raw HTML content of a webpage. What's next? Raw HTML is often messy! It contains not just the main article or product description you might care about, but also: * Navigation menus * Advertisements * Headers and footers * Hidden code like JavaScript (`