9 pointsby novalis789 hours ago1 comment
  • pona-a3 hours ago
    I feel like normalization would be a nightmare. Consider all the mistranscriptions, OCR errors, and different names in the libraries (case, parentheticals, etc).

    If we assume there's no reliable way to define a book, maybe locally sensitive hashing could help find probably same books.

    The idea is pretty cool though.