Dahu Search - more from the web

Passion. Innovation. Experience. Search.

Dahu data extraction from the web

Dahu combines a passion for innovation with decades of experience in building effective search to help you find real value from the web.

High-value, rich data really can be mined from the low-value, long tail of the web, but traditional methods prove far too expensive to distil it in a scalable and cost-effective way. Dahu are combining the best existing approaches for web mining with unique cutting-edge research to build tools that extract order and insight from today's web at a fraction of the cost of existing services. The Dahu Enriched Data Granularity Engine (EDGE) encompasses intelligent web crawling, data disambiguation, automated meaning-based extraction, content enrichment and scalable data analysis to build intelligent XML data feeds. More…

Our consultants have decades of experience in a wide range of search technologies, from high-performance engines like Exalead's CloudView and MarkLogic through to commodity open-source platforms like Apache lucene/Solr and ElasticSearch. Now with the power of EDGE, we are helping our customers to build a new generation of online applications powered by intelligent data harvested from the web, with high scalability and low total cost of ownership.

 

Dahu search gets data value from web pages