Streaming decompression

  • Sketched out streaming decompression of NDJSON archive files from PushShift in the BufferedRedditArchivDecoder class.
  • Allowed Reddit Submissions to have either Int32 or String created_utc fields (need coextend this to other Types)
  • Uncommented K-means clustering

Merge request reports

Loading