2 comments

  • reconnecting8 minutes ago
    Discussion on reddit: <a href="https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;LocalLLaMA&#x2F;comments&#x2F;1rewis9&#x2F;removed_by_moderator&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;LocalLLaMA&#x2F;comments&#x2F;1rewis9&#x2F;removed...</a>
  • medi_naseri4 hours ago
    This is so freaking awesome, I am working on a project trying run 10 models on two GPUs, loading&#x2F;off loading is the only solution I have in mind.<p>Will try getting this deployed.<p>Does cold start timings advertised for a condition where there is no other model loaded on GPUs?