if you have a decent ebook collection can you please try it out and tell me what you think?
https://www.zabkat.com/test/ideclone_setup_beta.exe
you must also download the ebook filter and preview pack from here
http://zabkat.com/blog/mobi-epub-xps-pa ... search.htm
thanks for your feedback
A new algorithm can detect similar books by CONTENT, a must since e-book author and such tags are unreliable. It can find books in your collection that exist in multiple formats e.g. both EPUB and PDF or AZW. The scan is a bit slow but quite accurate.
Just select e-Books as the scan category, add folder(s) to scan containing your book collection, tick "Find similar" and let it find those multiple books you have. Use the preview pane to browse the discovered books, or select TWO books in a group and press ALT+2 (COMPARE ITEMS context menu command) to compare the books for plain text differences. Then apply the usual Mark and Cleanup operations to get rid of the unwanted duplicates.
Note only the TEXT portion of each book is examined. Any differences in embedded images (or formatting) aren't taken into account. Furthermore only the first few KB of each book are scanned, not the entire book. It is possible that books may have differences in later sections that aren't read. However since books are first matched for CharacterCount property, the probability of such mistakes is minimized. False positives are not out of the question of course!


