feat(logloss) init

stared · stared · commit 83ca90a0159c · 2019-12-16T13:42:38.000+01:00
diff --git a/7 Log loss.ipynb b/7 Log loss.ipynb
@@ -0,0 +1,106 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Thinking in tensors, writing in PyTorch\n",
+    "\n",
+    "Hands-on training  by [Piotr Migdał](https://p.migdal.pl) (2019). Version for Uniwersytet Śląski.\n",
+    "\n",
+    "**WORK IN PROGRESS**\n",
+    "\n",
+    "\n",
+    "## Log loss\n",
+    "\n",
+    "Multi-class logistic regression can be expressed as a shallow neural network consisting of one linear layer and a softmax activation function.\n",
+    "\n",
+    "For binary classification, we can use sigmoid (a.k.a. logistic function):\n",
+    "\n",
+    "$$ \\sigma(x) = \\frac{1}{1+\\exp(-x)} $$\n",
+    "\n",
+    "Softmax function transforms any vector into distribution vector (values in range (0., 1.) that sum up to 1.):\n",
+    "$$\\text{softmax}(x_i) = \\frac{\\exp(x_i)}{\\sum_j \\exp(x_j)}$$\n",
+    "\n",
+    "We use a cross-entropy loss function:\n",
+    "$$- \\sum_j p_{j, true} \\log(p_{j, pred})$$\n",
+    "\n",
+    "Note that we do not state explicitly the softmax function in the model class below. For details see [torch.nn.CrossEntropyLoss](https://pytorch.org/docs/stable/nn.html#torch.nn.CrossEntropyLoss).\n",
+    "\n",
+    "See also:\n",
+    "\n",
+    "* [Cross-entropy vs. mean-squared error loss](https://www.reddit.com/r/MachineLearning/comments/8im9eb/d_crossentropy_vs_meansquared_error_loss/)\n",
+    "* [Understanding binary cross-entropy / log loss: a visual explanation](https://towardsdatascience.com/understanding-binary-cross-entropy-log-loss-a-visual-explanation-a3ac6025181a)\n",
+    "* [Cross entropy](https://pandeykartikey.github.io/machine/learning/basics/2018/05/22/cross-entropy.html) - another explanation\n",
+    "* [Softmax function](https://en.wikipedia.org/wiki/Softmax_function)\n",
+    "* [Multiclass logistic regression](https://en.wikipedia.org/wiki/Multinomial_logistic_regression)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import numpy as np"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python [conda env:py37]",
+   "language": "python",
+   "name": "conda-env-py37-py"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.7.2"
+  },
+  "varInspector": {
+   "cols": {
+    "lenName": 16,
+    "lenType": 16,
+    "lenVar": 40
+   },
+   "kernels_config": {
+    "python": {
+     "delete_cmd_postfix": "",
+     "delete_cmd_prefix": "del ",
+     "library": "var_list.py",
+     "varRefreshCmd": "print(var_dic_list())"
+    },
+    "r": {
+     "delete_cmd_postfix": ") ",
+     "delete_cmd_prefix": "rm(",
+     "library": "var_list.r",
+     "varRefreshCmd": "cat(var_dic_list()) "
+    }
+   },
+   "types_to_exclude": [
+    "module",
+    "function",
+    "builtin_function_or_method",
+    "instance",
+    "_Feature"
+   ],
+   "window_display": false
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}