<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Model Support for Video LLama in Intel® Gaudi® AI Accelerator</title>
    <link>https://community.intel.com/t5/Intel-Gaudi-AI-Accelerator/Model-Support-for-Video-LLama/m-p/1678447#M73</link>
    <description>&lt;P&gt;My research on supporting HPU on&amp;nbsp;&lt;A href="https://github.com/opea-project/GenAIComps/blob/main/comps/lvms/src/integrations/video_llama.py" target="_blank"&gt;https://github.com/opea-project/GenAIComps/blob/main/comps/lvms/src/integrations/video_llama.py -&amp;nbsp;&lt;/A&gt;found the following:&lt;/P&gt;&lt;P&gt;The task is to adapt the video-lama server to HPU the following needs to occur:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;A Dockerfile like the llava Dockerfile.hpu docker file must be created for video-llama integration using the latest Intel Gaudi software pytorch container as the base container. This should be integrated into this directory: GenAIComps/tree/main/comps/lvms/src/integrations/dependency/video-llama&lt;/LI&gt;&lt;LI&gt;The server.py file for video-llama needs to be modified to use the hpu (look at GenAIComps/blob/main/comps/lvms/src/integrations/dependency/llava/llava_server.py for insight).&lt;/LI&gt;&lt;LI&gt;The current model specified in video_llama_eval_only_vl.yaml is: /home/user/model/Video-LLaMA-2-7B-Finetuned/llama-2-7b-chat-hf which is a fine tuned version of meta-llama/Llama-2-7b-chat-hf. This model is a supported model on Gaudi, so the finetuned model should be supported on Gaudi as well.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;From what I can tell there is nothing blocking the implementation of the video-llama service on a Gaudi using the Intel Gaudi software. Do you have specific concern that needs to be addressed or are you just interested in the OPEA team doing the integration?&lt;/P&gt;</description>
    <pubDate>Thu, 27 Mar 2025 18:39:34 GMT</pubDate>
    <dc:creator>James_Edwards</dc:creator>
    <dc:date>2025-03-27T18:39:34Z</dc:date>
    <item>
      <title>Model Support for Video LLama</title>
      <link>https://community.intel.com/t5/Intel-Gaudi-AI-Accelerator/Model-Support-for-Video-LLama/m-p/1675612#M56</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Currently there is a VideoQna sample on OPEA that runs on xeon. THere is customer interest for thsi use case on accelerator. Looking for model support on gaudi for this microservice :&amp;nbsp;&lt;A href="https://github.com/opea-project/GenAIComps/blob/main/comps/lvms/src/integrations/video_llama.py" target="_blank"&gt;https://github.com/opea-project/GenAIComps/blob/main/comps/lvms/src/integrations/video_llama.py&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 17 Mar 2025 19:15:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Gaudi-AI-Accelerator/Model-Support-for-Video-LLama/m-p/1675612#M56</guid>
      <dc:creator>Srinarayan</dc:creator>
      <dc:date>2025-03-17T19:15:15Z</dc:date>
    </item>
    <item>
      <title>Re: Model Support for Video LLama</title>
      <link>https://community.intel.com/t5/Intel-Gaudi-AI-Accelerator/Model-Support-for-Video-LLama/m-p/1678447#M73</link>
      <description>&lt;P&gt;My research on supporting HPU on&amp;nbsp;&lt;A href="https://github.com/opea-project/GenAIComps/blob/main/comps/lvms/src/integrations/video_llama.py" target="_blank"&gt;https://github.com/opea-project/GenAIComps/blob/main/comps/lvms/src/integrations/video_llama.py -&amp;nbsp;&lt;/A&gt;found the following:&lt;/P&gt;&lt;P&gt;The task is to adapt the video-lama server to HPU the following needs to occur:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;A Dockerfile like the llava Dockerfile.hpu docker file must be created for video-llama integration using the latest Intel Gaudi software pytorch container as the base container. This should be integrated into this directory: GenAIComps/tree/main/comps/lvms/src/integrations/dependency/video-llama&lt;/LI&gt;&lt;LI&gt;The server.py file for video-llama needs to be modified to use the hpu (look at GenAIComps/blob/main/comps/lvms/src/integrations/dependency/llava/llava_server.py for insight).&lt;/LI&gt;&lt;LI&gt;The current model specified in video_llama_eval_only_vl.yaml is: /home/user/model/Video-LLaMA-2-7B-Finetuned/llama-2-7b-chat-hf which is a fine tuned version of meta-llama/Llama-2-7b-chat-hf. This model is a supported model on Gaudi, so the finetuned model should be supported on Gaudi as well.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;From what I can tell there is nothing blocking the implementation of the video-llama service on a Gaudi using the Intel Gaudi software. Do you have specific concern that needs to be addressed or are you just interested in the OPEA team doing the integration?&lt;/P&gt;</description>
      <pubDate>Thu, 27 Mar 2025 18:39:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Gaudi-AI-Accelerator/Model-Support-for-Video-LLama/m-p/1678447#M73</guid>
      <dc:creator>James_Edwards</dc:creator>
      <dc:date>2025-03-27T18:39:34Z</dc:date>
    </item>
  </channel>
</rss>

