10

8

6

4

2


8.8

6.7

6.8

6.8

5.5

8.5

5.2

8.0

4.3

5.7

5 Web Crawling libraries and projects

  • Crawler4j

    8.8 6.7 L2 Java
    Simple and lightweight web crawler.
  • JSoup

    6.8 6.8 L2 Java
    Scrapes, parses, manipulates and cleans HTML.
  • storm-crawler

    5.5 8.5 Java
    Web crawler SDK based on Apache Storm
  • Apache Nutch

    5.2 8.0 L2 Java
    Highly extensible, highly scalable web crawler for production environment.
  • Sparkler

    4.3 5.7 Java
    Spark-Crawler : Evolving Apache Nutch to run on Spark.

Add another 'Web Crawling' Library