<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Machine Learning on Joris Bukala | Math &amp; ML</title>
    <link>https://jorisbukala.com/categories/machine-learning/</link>
    <description>Recent content in Machine Learning on Joris Bukala | Math &amp; ML</description>
    <generator>Hugo -- 0.153.4</generator>
    <language>en-us</language>
    <lastBuildDate>Wed, 01 Nov 2023 19:54:22 +0100</lastBuildDate>
    <atom:link href="https://jorisbukala.com/categories/machine-learning/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Understanding Neural Networks</title>
      <link>https://jorisbukala.com/posts/understanding-nns/</link>
      <pubDate>Wed, 01 Nov 2023 19:54:22 +0100</pubDate>
      <guid>https://jorisbukala.com/posts/understanding-nns/</guid>
      <description>&lt;h2 id=&#34;understanding-of-neural-nets-from-first-principles-brain-dump&#34;&gt;Understanding of Neural nets from first-principles: Brain dump&lt;/h2&gt;
&lt;p&gt;So I was reading my company&amp;rsquo;s IT newsletter the other day where one of the topics was sparse modeling (&lt;a href=&#34;https://www.forbes.com/sites/johnwerner/2023/08/17/sparse-models-the-math-and-a-new-theory-for-ground-breaking-ai/&#34;&gt;discussing this Forbes article&lt;/a&gt;) and it got me thinking again about some things I was reading the past months, about trying to understand how and why (mainly) Deep Learning works.&lt;/p&gt;
&lt;h3 id=&#34;first-principles&#34;&gt;First-principles&lt;/h3&gt;
&lt;p&gt;In a way, sure we know how it works on the microscopic level of each individual neuron (activation functions, matrix multiplications, gradient descent and all that), and we also often describe it at a high level (where we tend to greatly anthropomorphize it: &amp;ldquo;the model learned to do &lt;em&gt;X&lt;/em&gt; because in all its examples it saw this object from the same angle, ..&amp;rdquo;). But there are many questions in between where it seems we never connected the dots, instead relying on empiricism, often crude observations and post-hoc justifications for choices here:&lt;/p&gt;</description>
    </item>
  </channel>
</rss>
