]> code.communitydata.science - cdsc_reddit.git/blobdiff - run_tf_jobs.sh
Use groupby - joins instead of windows
[cdsc_reddit.git] / run_tf_jobs.sh
diff --git a/run_tf_jobs.sh b/run_tf_jobs.sh
new file mode 100755 (executable)
index 0000000..fc191d4
--- /dev/null
@@ -0,0 +1,8 @@
+#!/usr/bin/env bash
+module load parallel_sql
+source ../bin/activate
+python3 tf_comments.py gen_task_list
+psu --del --Y
+cat tf_task_list | psu --load
+
+for job in $(seq 1 50); do sbatch checkpoint_parallelsql.sbatch; done;

Community Data Science Collective || Want to submit a patch?