]> code.communitydata.science - cdsc_reddit.git/commitdiff
add note to try other tf normalization strategies. master
authorNathan TeBlunthuis <nathante@uw.edu>
Thu, 31 Mar 2022 19:17:16 +0000 (12:17 -0700)
committerNathan TeBlunthuis <nathante@uw.edu>
Thu, 31 Mar 2022 19:17:16 +0000 (12:17 -0700)
similarities/TODO [new file with mode: 0644]

diff --git a/similarities/TODO b/similarities/TODO
new file mode 100644 (file)
index 0000000..bc1e425
--- /dev/null
@@ -0,0 +1 @@
+Try normalizing tf by the mean or std instead of the max to avoid penalizing subreddits with very active users.

Community Data Science Collective || Want to submit a patch?