This talk will give an overview of Apache Nutch, its main components, how it fits with other Apache projects and its latest developments.
This talk will give an overview of Apache Nutch. I will describe its main components and how it fits with other Apache projects such as Hadoop, Lucene, SOLR, Tika or HBase.
The second part of the presentation will be focused on the latest developments in Nutch and the changed introduces by the brand new version 2.0.