Genetic programming approach to record deduplication
Our Price
₹2,500.00
10000 in stock
Support
Ready to Ship
Description
Several system relay on consistent data such as digital libraries may be affected by existence of duplicates .Several deduplication strategies are available but they relay on manually chosen settings to combine evidence used to identify the records as being replicas.Genetic Programming approach was used to record deduplication . This method effectively identifies replicas in the repositories using genetic operations.Because of that we can provide consistent data to offer high quality services and potential savings in computational time. Genetic programming approach used to provide consistent data. This program automatically selects fitness functions using this function duplicate records are eliminated. our approach outperforms an existing state-of-the-art method found in the literature. Moreover, the suggested functions are computationally less demanding since they use fewer evidence. In addition, our genetic programming approach is capable of automatically adapting these functions to a given fixed replica identification boundary, freeing the user from the burden of having to choose and tune this parameter.
Tags: 2012, Data Mining Projects, Java