A US-based leading info publisher who provides data-based insights to their customers needed a database refresh service.
Their database covered more than 250 data points, and company-related information such as employee data, services, products, and affiliates that needed validation.
It was difficult for them to aggregate and validate millions of records and manage voluminous processes in-house. This required an automated approach and they identified RefreshData360 to help them aggregate, refresh, and validate their database.
The volume of data to be refreshed was huge.
The sources from which the data was to be refreshed spanned across different websites and these sources often lacked adequate information.
It required extensive research across multiple domains to validate the information.
Additionally, the extracted data was also prone to errors that necessitated data validation.
These processes could not be handled manually and it required the appropriate tools and platforms to handle data refreshing, deduplication, and validation of data.
The data stewards at X-tract.io analyzed the challenges and implemented a step-by-step solution using robust, scalable, and in-house platforms - Worxtream and Mojo.
It was challenging to monitor the data across numerous websites and track the changes.
It was challenging to validate huge volumes of data and ensure accuracy.
It was challenging for the client to access the humongous volume of refreshed data
The overall spend was reduced by $300k per year
The workforce was reduced by 30%
The per-record processing time was reduced from 35 minutes to 24 minutes
Automation helped to reduce the overall manual verification efforts by 15%
The formatting errors were eliminated with the help of intensive quality check