]> code.communitydata.science - cdsc_reddit.git/commit
Build comments dataset similarly to submissions and improve partitioning scheme
authorNate E TeBlunthuis <nathante@mox2.hyak.local>
Tue, 7 Jul 2020 18:45:43 +0000 (11:45 -0700)
committerNate E TeBlunthuis <nathante@mox2.hyak.local>
Tue, 7 Jul 2020 18:45:43 +0000 (11:45 -0700)
commit40d45637702fb51feb9f99ff7f6d71787af765ed
tree571c545a7092a45db24f7d250f08d5c06a7dd132
parentfc6575a28716f6d1611f988c48d15e64a22687ac
Build comments dataset similarly to submissions and improve partitioning scheme
comments_2_parquet.py [deleted file]
comments_2_parquet.sh [new file with mode: 0755]
comments_2_parquet_part1.py [new file with mode: 0755]
comments_2_parquet_part2.py [new file with mode: 0755]
helper.py [new file with mode: 0644]
submissions_2_parquet.sh
submissions_2_parquet_part1.py
submissions_2_parquet_part2.py

Community Data Science Collective || Want to submit a patch?