code.communitydata.science - covid19.git/atom - wikipedia_views/resources/articles.txt history software for the COVID19 Digital Observatory https://code.communitydata.science/covid19.git Community Data Science Collective static/git-favicon.png static/git-logo.png 2020-04-01T14:15:12Z gitweb changes to a bunch of the wikipedia view code 2020-04-01T14:15:12Z Benjamin Mako Hill mako@atdot.cc Benjamin Mako Hill mako@atdot.cc 2020-04-01T14:15:12Z https://code.communitydata.science/covid19.git/commitdiff/38fdd07b39f63de88dd985787eb2ac3a5866670c
changes to a bunch of the wikipedia view code

- Renamed the articles.txt to something more specific

Changes to both scripts:

- Updated filenames to match the new standard
- Reworked the logging code so that it can write to stderr by
  default. Because we can only call logging.basicConfig() once, this
  eneded up being a bigger changes.
- Caused scripts to output git commits and export to track which code
  produced which dataset.
- Caused programs to take files instead of directories as
  output (allows us to run programs more than once a day).

Changes to the wikipedia_views/scripts/fetch_daily_views.py:

- Change output that it outputs a sequence of JSON dictionaries (one
  per line) as per the standard we agreed to and which is what
  Twitter, Github, and other dumps do. Previous behavior was to create
  output a single JSON list object.
- A number of other small changes and tweaks throughout.
  • [D] wikipedia_views/resources/articles.txt
new version of this from scrape. no double quotes around articles any 2020-03-29T06:47:55Z Kaylea Champion kaylea@uw.edu Kaylea Champion kaylea@uw.edu 2020-03-29T06:47:55Z https://code.communitydata.science/covid19.git/commitdiff/fa8e977741938dd9958d0fb42f41cac7311584aa
new version of this from scrape. no double quotes around articles any
more
  • [D] wikipedia_views/resources/articles.txt
Reorganize wikipedia views subproject into subpackage. 2020-03-28T03:13:11Z Nathan TeBlunthuis nathante@uw.edu Nathan TeBlunthuis nathante@uw.edu 2020-03-28T03:13:11Z https://code.communitydata.science/covid19.git/commitdiff/c54d8ba28ac8fe22b3da3ee171277fbabffe64fb
Reorganize wikipedia views subproject into subpackage.
  • [D] wikipedia_views/resources/articles.txt