10

8

6

4

2


9.0

7.3

8.5

6.7

8.0

6.8

3 Web Crawling libraries and projects

  • JSoup

    9.0 7.3 F Java
    Scrapes, parses, manipulates and cleans HTML.
  • Crawler4j

    8.5 6.7 F Java
    Simple and lightweight web crawler.
  • Apache Nutch

    8.0 6.8 F Java
    Highly extensible, highly scalable web crawler for production environment.

Add another 'Web Crawling' Library