[Notebook] Added section about preparation of exported ONNX model to process it in FINN

ce236744 · auphelia · 462853ab · ce236744
Commit ce236744 authored 5 years ago by auphelia
--- a/notebooks/9-FINN-EndToEndFlow.ipynb
+++ b/notebooks/9-FINN-EndToEndFlow.ipynb
@@ -6,7 +6,21 @@
   "source": [
    "# FINN - End-to-End Flow\n",
    "-----------------------------------------------------------------\n",
-    "This notebook gives an overview about the end to end flow of FINN. From loading an ONNX model from Brevitas, followed by the numerous transformations in FINN and up to the generation of a bitstream that can be used to load an FPGA. "
+    "This notebook gives an overview about the end to end flow of FINN. From loading an ONNX model from Brevitas, followed by the numerous transformations in FINN and up to the generation of a bitstream that can be used to load an FPGA. \n",
+    "\n",
+    "We'll use the following showSrc function to print the source code for function calls in the Jupyter notebook."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import inspect\n",
+    "\n",
+    "def showSrc(what):\n",
+    "    print(\"\".join(inspect.getsourcelines(what)[0]))"
   ]
  },
  {
@@ -15,14 +29,134 @@
   "source": [
    "## Outline\n",
    "-------------\n",
-    "* Preparation of model to pass it to FINN\n",
-    "* FINN transformations\n",
+    "1. Preparation of model to pass to FINN\n",
+    "2. FINN transformations\n",
    "    * first transformations\n",
    "    * Streamline\n",
    "    * last transformations\n",
-    "* Verification\n",
-    "* Bitstream generation\n"
+    "3. Verification\n",
+    "4. Bitstream generation"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### 1. Preparation of model to pass to FINN\n",
+    "FINN expects an ONNX model as input. This can be a model trained with [Brevitas](https://github.com/Xilinx/brevitas). Brevitas is a Pytorch library for quantization-aware training and the FINN Docker image comes with several [example Brevitas networks](https://github.com/maltanar/brevitas_cnv_lfc). To show the FINN end-to-end flow, we'll use the LFC-w1a1 model as example network. The Brevitas export is only briefly described here, for details see Jupyter notebook [3-FINN-Brevitas-network-import](3-FINN-Brevitas-network-import.ipynb).\n",
+    "\n",
+    "First a few things have to be imported. Then the model can be loaded with the pretrained weights."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/workspace/brevitas_cnv_lfc/training_scripts/models/LFC.py:73: TracerWarning: torch.tensor results are registered as constants in the trace. You can safely ignore this warning if you use this function to create tensors out of constant variables that would be the same every time you call this function. In any other case, this might cause the trace to be incorrect.\n",
+      "  x = 2.0 * x - torch.tensor([1.0])\n"
+     ]
+    }
+   ],
+   "source": [
+    "import torch\n",
+    "import brevitas.onnx as bo\n",
+    "from models.LFC import LFC\n",
+    "\n",
+    "lfc = LFC(weight_bit_width=1, act_bit_width=1, in_bit_width=1)\n",
+    "trained_lfc_checkpoint = (\"/workspace/brevitas_cnv_lfc/pretrained_models/LFC_1W1A/checkpoints/best.tar\")\n",
+    "checkpoint = torch.load(trained_lfc_checkpoint, map_location=\"cpu\")\n",
+    "lfc.load_state_dict(checkpoint[\"state_dict\"])\n",
+    "bo.export_finn_onnx(lfc, (1, 1, 28, 28), \"lfc_w1_a1.onnx\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "The model was now exported, loaded with the pretrained weights and saved under the name \"lfc_w1_a1.onnx\".\n",
+    "To visualize the exported model, Netron can be used. Netron is a visualizer for neural networks and allows interactive investigation of network properties. For example, you can click on the individual nodes and view the properties."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "Stopping http://0.0.0.0:8081\n",
+      "Serving 'lfc_w1_a1.onnx' at http://0.0.0.0:8081\n"
+     ]
+    }
+   ],
+   "source": [
+    "import netron\n",
+    "netron.start(\"lfc_w1_a1.onnx\", port=8081, host=\"0.0.0.0\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<iframe src=\"http://0.0.0.0:8081/\" style=\"position: relative; width: 100%;\" height=\"400\"></iframe>\n"
+      ],
+      "text/plain": [
+       "<IPython.core.display.HTML object>"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    }
+   ],
+   "source": [
+    "%%html\n",
+    "<iframe src=\"http://0.0.0.0:8081/\" style=\"position: relative; width: 100%;\" height=\"400\"></iframe>"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now that we have the model in .onnx format, we can work with it using FINN. For that FINN `ModelWrapper` is used. It is a wrapper around the ONNX model which provides several helper functions to make it easier to work with the model. For details see Jupyter notebook [2-FINN-ModelWrapper](2-FINN-ModelWrapper.ipynb)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from finn.core.modelwrapper import ModelWrapper\n",
+    "model = ModelWrapper(\"lfc_w1_a1.onnx\")"
   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now the model is prepared and it can be processed in different ways. The principle of FINN are analysis and transformation passes, which can be applied to the model. An analysis pass extracts specific information about the model and returns it to the user in the form of a dictionary. For more details see [4-FINN-HowToAnalysisPass](4-FINN-HowToAnalysisPass.ipynb). A transformation pass changes the model and returns the changed model back to the FINN flow, for more information about transformation passes see notebook [5-FINN-HowToTransformationPass](5-FINN-HowToTransformationPass.ipynb).\n",
+    "\n",
+    "Since the goal in this notebook is to process the model to such an extent that a bitstream can be generated from it, the focus is on the transformations that are necessary for this. In the next section these are discussed in more detail."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
  }
 ],
 "metadata": {

 %% Cell type:markdown id: tags:

 # FINN - End-to-End Flow
 -----------------------------------------------------------------
 This notebook gives an overview about the end to end flow of FINN. From loading an ONNX model from Brevitas, followed by the numerous transformations in FINN and up to the generation of a bitstream that can be used to load an FPGA.

+We'll use the following showSrc function to print the source code for function calls in the Jupyter notebook.
+
+%% Cell type:code id: tags:
+
+``` python
+import inspect
+
+def showSrc(what):
+    print("".join(inspect.getsourcelines(what)[0]))
+```
+
 %% Cell type:markdown id: tags:

 ## Outline
 -------------
-* Preparation of model to pass it to FINN
-* FINN transformations
+1. Preparation of model to pass to FINN
+2. FINN transformations
    * first transformations
    * Streamline
    * last transformations
-* Verification
-* Bitstream generation
+3. Verification
+4. Bitstream generation
+
+%% Cell type:markdown id: tags:
+
+### 1. Preparation of model to pass to FINN
+FINN expects an ONNX model as input. This can be a model trained with [Brevitas](https://github.com/Xilinx/brevitas). Brevitas is a Pytorch library for quantization-aware training and the FINN Docker image comes with several [example Brevitas networks](https://github.com/maltanar/brevitas_cnv_lfc). To show the FINN end-to-end flow, we'll use the LFC-w1a1 model as example network. The Brevitas export is only briefly described here, for details see Jupyter notebook [3-FINN-Brevitas-network-import](3-FINN-Brevitas-network-import.ipynb).
+
+First a few things have to be imported. Then the model can be loaded with the pretrained weights.
+
+%% Cell type:code id: tags:
+
+``` python
+import torch
+import brevitas.onnx as bo
+from models.LFC import LFC
+
+lfc = LFC(weight_bit_width=1, act_bit_width=1, in_bit_width=1)
+trained_lfc_checkpoint = ("/workspace/brevitas_cnv_lfc/pretrained_models/LFC_1W1A/checkpoints/best.tar")
+checkpoint = torch.load(trained_lfc_checkpoint, map_location="cpu")
+lfc.load_state_dict(checkpoint["state_dict"])
+bo.export_finn_onnx(lfc, (1, 1, 28, 28), "lfc_w1_a1.onnx")
+```
+
+%% Output
+
+    /workspace/brevitas_cnv_lfc/training_scripts/models/LFC.py:73: TracerWarning: torch.tensor results are registered as constants in the trace. You can safely ignore this warning if you use this function to create tensors out of constant variables that would be the same every time you call this function. In any other case, this might cause the trace to be incorrect.
+      x = 2.0 * x - torch.tensor([1.0])
+
+%% Cell type:markdown id: tags:
+
+The model was now exported, loaded with the pretrained weights and saved under the name "lfc_w1_a1.onnx".
+To visualize the exported model, Netron can be used. Netron is a visualizer for neural networks and allows interactive investigation of network properties. For example, you can click on the individual nodes and view the properties.
+
+%% Cell type:code id: tags:
+
+``` python
+import netron
+netron.start("lfc_w1_a1.onnx", port=8081, host="0.0.0.0")
+```
+
+%% Output
+
+    
+    Stopping http://0.0.0.0:8081
+    Serving 'lfc_w1_a1.onnx' at http://0.0.0.0:8081
+
+%% Cell type:code id: tags:
+
+``` python
+%%html
+<iframe src="http://0.0.0.0:8081/" style="position: relative; width: 100%;" height="400"></iframe>
+```
+
+%% Output
+
+
+%% Cell type:markdown id: tags:
+
+Now that we have the model in .onnx format, we can work with it using FINN. For that FINN `ModelWrapper` is used. It is a wrapper around the ONNX model which provides several helper functions to make it easier to work with the model. For details see Jupyter notebook [2-FINN-ModelWrapper](2-FINN-ModelWrapper.ipynb).
+
+%% Cell type:code id: tags:
+
+``` python
+from finn.core.modelwrapper import ModelWrapper
+model = ModelWrapper("lfc_w1_a1.onnx")
+```
+
+%% Cell type:markdown id: tags:
+
+Now the model is prepared and it can be processed in different ways. The principle of FINN are analysis and transformation passes, which can be applied to the model. An analysis pass extracts specific information about the model and returns it to the user in the form of a dictionary. For more details see [4-FINN-HowToAnalysisPass](4-FINN-HowToAnalysisPass.ipynb). A transformation pass changes the model and returns the changed model back to the FINN flow, for more information about transformation passes see notebook [5-FINN-HowToTransformationPass](5-FINN-HowToTransformationPass.ipynb).
+
+Since the goal in this notebook is to process the model to such an extent that a bitstream can be generated from it, the focus is on the transformations that are necessary for this. In the next section these are discussed in more detail.
+
+%% Cell type:code id: tags:
+
+``` python
+```