]> code.communitydata.science - cdsc_reddit.git/blobdiff - similarities/TODO
add note to try other tf normalization strategies.
[cdsc_reddit.git] / similarities / TODO
diff --git a/similarities/TODO b/similarities/TODO
new file mode 100644 (file)
index 0000000..bc1e425
--- /dev/null
@@ -0,0 +1 @@
+Try normalizing tf by the mean or std instead of the max to avoid penalizing subreddits with very active users.

Community Data Science Collective || Want to submit a patch?