@@ -15,3 +15,3 @@ Based on [[https://schlockmercenary.fandom.com/wiki/The_Seventy_Maxims_of_Maxima
*. Do unto others’ hyperparameters as you would have them do unto yours.
-*. “Innovative architecture” means never asking “did we implement the baseline correctly?”
+*. “Innovative architecture” means never asking “did we implement a proper baseline?”
*. Only you can prevent vanishing gradients.
@@ -22,3 +22,3 @@ Based on [[https://schlockmercenary.fandom.com/wiki/The_Seventy_Maxims_of_Maxima
*. If you’re not willing to prune your own layers, you’re not willing to deploy.
-*. Give a model a labeled dataset, and it trains for a day. Take its labels away and call it “self-supervised,” and it’ll generate new ones for you to validate tomorrow.
+*. Give a model a labeled dataset, and it trains for a day. Take its labels away and call it “self-supervised” and it’ll generate new ones for you to validate tomorrow.
*. If you’re manually labeling data, somebody’s done something wrong.