A web content extraction library that removes clutter like ads, sidebars, and navigation to extract clean main content from web pages. Returns cleaned HTML or Markdown with metadata extraction.
Developer ToolsOpen SourceLibraries★ 6.1kMITbuilt by @kepano1mo ago