1 pointby ibegtin5 hours ago1 comment
  • ibegtin5 hours ago
    Hi HN, I've made major progress in implementing Python lib, Iterable Data for reading and writing data files row by row in a consistent, iterator-based interface. It provides a unified API for working with various data formats (CSV, JSON, Parquet, XML, etc.) similar to csv.DictReader but supporting many more formats.

    I use it extensively inside our dataset search engine Dateno and it's helpful since I had to operate with wild range of data formats and something more or less universal was really needed.

    Repo: https://github.com/datenoio/iterabledata

    I'd love feedback from folks who reads, converts and processes data regularly and existing tools are not ideal for these tasks!

    Now I am lookng for