10

8

6

4

2


9.1

8.7

8.7

6.4

8.1

7.7

4.9

8.5

3.2

7.8

5 Web Crawling libraries and projects

  • JSoup

    9.1 8.7 L2 Java
    Scrapes, parses, manipulates and cleans HTML.
  • Crawler4j

    8.7 6.4 L2 Java
    Simple and lightweight web crawler.
  • Apache Nutch

    8.1 7.7 L2 Java
    Highly extensible, highly scalable web crawler for production environment.
  • storm-crawler

    4.9 8.5 Java
    Web crawler SDK based on Apache Storm
  • Sparkler

    3.2 7.8 Java
    Spark-Crawler : Evolving Apache Nutch to run on Spark.

Add another 'Web Crawling' Library