]> code.communitydata.science - cdsc_reddit.git/blobdiff - comments_2_parquet.py
Move the spark part of submissions_2_parquet to a separate script.
[cdsc_reddit.git] / comments_2_parquet.py
index bd853f72daf8899a371c06cde85150bf2fb3b418..8716abdccea4fc62d2d3bb01b3a7ce3ba2e8387c 100755 (executable)
@@ -1,5 +1,5 @@
-
 #!/usr/bin/env python3
+
 import pyspark
 from pyspark.sql import functions as f
 from pyspark.sql.types import *

Community Data Science Collective || Want to submit a patch?