Need Nutch experts to help with Apache Nutch, a highly extensible and scalable open source web crawler software project? Hire Experfy freelance Nutch experts capable of configuring Nutch to crawl in local mode and post to Apache Solr for search/index, setup and run Nutch using Cassandra as storage, setup and run Nutch in Hadoop pseudo-distributed mode. They can also configure, build, crawl and debug Nutch within Eclipse, run Apache Nutch on Elastic MapReduce, and enable Nutch to authenticate itself using NTLM, Basic or Digest authentication schemes. Further, our Nutch experts know how to use the ParseChecker tool to quickly scrape a website, re-crawl with Nutch, map Nutch Hbase table to Hive. They are also familiar with optimizing crawling/fetching speed with Nutch, and how to add desirable options to your Nutch intranet crawling configuration.
Hire Experfy vetted freelance Nutch experts capable of using Nutch with Cloudsearch, including pseudo-distributed mode, and getting a nice UI on top of your Nutch crawl data.
PhD Candidate, Blue Brain Project, Neuroscience at EPFL
CH | Vevey, VD
Experfy is doing something groundbreaking - it is assembling some of the most prestigious talent in big data, analytics and engineering space. Our deep candidate pool is built through rigorous screening so you only hire the very best!
"Today's hottest companies are all data-driven. The Experfy team has developed an ecosystem that allows business and highly qualified data scientists to connect and develop powerful algorithms that can deliver 10x or 100x performance and growth. Watch this company closely."