
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Alessandro Iannacone – DevOps Engineer &amp; Cloud Specialist</title>
      <link>https://www.iannaconealessandro.it/blog</link>
      <description>Sito personale di Alessandro Iannacone, DevOps Engineer specializzato in cloud, automazione e infrastrutture IT complesse.</description>
      <language>it-IT</language>
      <managingEditor>me@iannacone.it (Alessandro Iannacone)</managingEditor>
      <webMaster>me@iannacone.it (Alessandro Iannacone)</webMaster>
      <lastBuildDate>Mon, 01 Jun 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.iannaconealessandro.it/tags/mlops/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.iannaconealessandro.it/blog/oltre-i-pod-k8s-serving-llm-multi-trilioni-parametri</guid>
    <title>Oltre i Pod: come Kubernetes gestisce il serving di LLM multi-trilioni di parametri</title>
    <link>https://www.iannaconealessandro.it/blog/oltre-i-pod-k8s-serving-llm-multi-trilioni-parametri</link>
    <description>Kubernetes non esegue i calcoli del modello: orchestra risorse GPU, rete e scheduling. Vediamo come tensor, pipeline ed expert parallelism rendono possibile il serving di LLM enormi.</description>
    <pubDate>Mon, 01 Jun 2026 00:00:00 GMT</pubDate>
    <author>me@iannacone.it (Alessandro Iannacone)</author>
    <category>kubernetes</category><category>llm</category><category>gpu</category><category>ai infrastructure</category><category>mlops</category>
  </item>

    </channel>
  </rss>
