]> code.communitydata.science - mediawiki_dump_tools.git/commit
write regex captures to parquet arrays. redirects
authorNathan TeBlunthuis <nathante@uw.edu>
Wed, 30 Mar 2022 00:52:26 +0000 (17:52 -0700)
committerNathan TeBlunthuis <nathante@uw.edu>
Wed, 30 Mar 2022 00:52:26 +0000 (17:52 -0700)
commitb124f9c7c891b8b98441ef1b185ba1a1a4a32179
tree381ae49b919c766567dfedf9a87d883ad89bac06
parent32283aa4da2eb256af9bec2e2d42481a1ca19d0b
write regex captures to parquet arrays.
test/Wikiq_Unit_Test.py
test/baseline_output/basic_regextest_0.tsv
test/baseline_output/basic_regextest_1.tsv
test/baseline_output/basic_regextest_2.tsv
test/baseline_output/basic_regextest_3.tsv
test/baseline_output/capturegroup_regextest_0.tsv
test/baseline_output/capturegroup_regextest_1.tsv
test/baseline_output/redirect_pokemonfandomcom_fr-20200215-history.tsv
wikiq

Community Data Science Collective || Want to submit a patch?