ComfyUIとHunyuanVideoでローカル動画生成(I2V)

いつの間にかローカル動画生成が進化していました。
今回はComfyUIのインストールから、HunyuanVideoを使ってImage2Videoを実行するまでを書いていきます。

はじめに

実行環境

Windows 11
NVIDIA RTX 4080 16GB

使うもの

Git for Windows

gitforwindows.org

Git for Windows

https://gitforwindows.org

We bring the awesome Git VCS to Windows

7zip

7-zip.opensource.jp

7-Zip

https://7-zip.opensource.jp

7-Zipは世界的にデファクトスタンダードのフリーの圧縮・展開 / 圧縮・解凍ソフトです。7z、zip、rar、lzh、ISO、tar、dmg、msiなど、さまざまな圧縮・データフォーマットに1つのソフトウェアで対応し、AES256による暗号化（パスワード圧縮）も可能です。

PowerShell7

必要ではありませんが、cmd より使いやすいのでおすすめです。

learn.microsoft.com

Windows への PowerShell のインストール - PowerShell

https://learn.microsoft.com/ja-jp/powershell/scripting/install/installing-powershell-on-windows?view=powershell-7.4#msi

Windows への PowerShell のインストールに関する情報

CUDA Toolkit

現時点(2025/02)で最新は12.8。

NVIDIA Developer

CUDA Toolkit 12.1 Downloads

https://developer.nvidia.com/cuda-downloads?target_os=Windows&target_arch=x86_64&target_version=11&target_type=exe_network

Get the latest feature updates to NVIDIA's proprietary compute stack.

12.4以上が必要です。
インストール済みの方も、12.4以上か確認してください。nvcc -V で確認できます。

PowerShell

> nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2025 NVIDIA Corporation
Built on Wed_Jan_15_19:38:46_Pacific_Standard_Time_2025
Cuda compilation tools, release 12.8, V12.8.61
Build cuda_12.8.r12.8/compiler.35404655_0

Visual Studio

wheel のビルドに必要です。

Visual Studio

無料の開発者ソフトウェアとサービス - Visual Studio

https://visualstudio.microsoft.com/ja/free-developer-offers

無料プラン: Visual Studio Community、Visual Studio Code、VSTS、Dev Essentials。

Visual Studio Community のインストーラーをダウンロードし、実行してください。
必要なものは「C++ によるデスクトップ開発」です。

Visual Studioインストーラーで「C++ によるデスクトップ開発」にチェックを入れている画像

ComfyUIのインストール

ComfyUIのダウンロード

GitHub

Releases · comfyanonymous/ComfyUI

https://github.com/comfyanonymous/ComfyUI/releases

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. - comfyanonymous/ComfyUI

ポータブル版をダウンロードして解凍します。

ComUI_windows_portable_nvidia.7zの入手場所を説明する画像

ComfyUI-Managerのインストール

解凍先の ComfyUI_windows_portable/ComfyUI/custom_nodes をターミナルで開いたら以下のコマンドを実行し、ComfyUI-Managerをクローンします。

PowerShell

git clone https://github.com/ltdrdata/ComfyUI-Manager comfyui-manager

ComfyUI-Managerのインストールの確認

ComfyUI_windows_portable/run_nvidia_gpu.bat を実行すると、http://127.0.0.1:8188/ がブラウザで開かれます。

右上にComfyUI-Managerのアイコンが表示されていればOK。

必要パッケージのインストール

SageAttention2 をインストールする必要があります。
必要要件は以下のとおりです。

python>=3.9
torch>=2.3.0
CUDA>=12.4
triton>=3.0.0

まずバージョン確認を行います。

Pythonのバージョン確認 python.exe -V

PowerShell

> ..\python_embeded\python.exe -V
Python 3.12.8

CUDAのバージョン確認 nvcc -V

PowerShell

> nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2025 NVIDIA Corporation
Built on Wed_Jan_15_19:38:46_Pacific_Standard_Time_2025
Cuda compilation tools, release 12.8, V12.8.61
Build cuda_12.8.r12.8/compiler.35404655_0

torchと諸々のインストール

ComfyUI_windows_portable/update フォルダに移動し、以下のコマンドを実行して必要パッケージをインストールします。

PowerShell

..\python_embeded\python.exe -s -m pip install "accelerate >= 1.1.1"
..\python_embeded\python.exe -s -m pip install "diffusers >= 0.31.0"
..\python_embeded\python.exe -s -m pip install "transformers >= 4.39.3"
..\python_embeded\python.exe -s -m pip install ninja
..\python_embeded\python.exe -s -m pip install --upgrade torch torchvision torchaudio xformers==0.0.29.post3 --index-url https://download.pytorch.org/whl/cu124

tritonのインストール

tritonパッケージのインストール

woct0rdho/triton-windows から最新のリリースをダウンロードします。

GitHub

Releases · woct0rdho/triton-windows

https://github.com/woct0rdho/triton-windows/releases

Fork of the Triton language and compiler for Windows support - woct0rdho/triton-windows

先ほど確認したPythonのバージョンに合わせてダウンロードしてください。私の場合は Python 3.12.8 だったので、triton-3.2.0-cp312-cp312-win_amd64.whl をダウンロードしました。

ダウンロードした .whl ファイルを ComfyUI_windows_portable/update に置き、以下のコマンドを実行しインストールます。※バージョン部分は適宜変更

PowerShell

..\python_embeded\python.exe -s -m pip install triton-3.2.0-cp312-cp312-win_amd64.whl

includeとlibsの配置

python_3.xx.x_include_libs.zipの場所を説明する画像

woct0rdho/triton-windows から python_3.12.7_include_libs.zip をダウンロードします。
(Python3.11系の場合は python_3.11.9_include_libs.zip)

GitHub

Release v3.0.0-windows.post1 ?? woct0rdho/triton-windows

https://github.com/woct0rdho/triton-windows/releases/tag/v3.0.0-windows.post1

More ways to find MSVC and Python

ダウンロードしたファイルを解凍し、include と libs を ComfyUI_windows_portable/python_embeded にコピーします。

includeとlibsをpython_embededにドラッグアンドドロップする画像

SageAttentionのインストール

Comfyi-windows-portable フォルダに移動し、以下のコマンドを実行しインストールします。

PowerShell

 git clone https://github.com/thu-ml/SageAttention 
 cd SageAttention
 ..\python_embeded\python.exe -m pip install .

bitsandbytesのインストール

4bit/8bit量子化を行うために必要です。

ComfyUI_windows_portable/update フォルダに移動し、以下のコマンドを実行します。

PowerShell

..\python_embeded\python.exe -s -m pip install bitsandbytes>=0.45.1

テスト

ここまでのインストールがうまくいっているか確認します。

ComfyUI_windows_portable/test.py を作成し、次のコードを記述してください。

Python

import torch
import triton
import triton.language as tl

@triton.jit
def add_kernel(x_ptr, y_ptr, output_ptr, n_elements, BLOCK_SIZE: tl.constexpr):
    pid = tl.program_id(axis=0)
    block_start = pid * BLOCK_SIZE
    offsets = block_start + tl.arange(0, BLOCK_SIZE)
    mask = offsets < n_elements
    x = tl.load(x_ptr + offsets, mask=mask)
    y = tl.load(y_ptr + offsets, mask=mask)
    output = x + y
    tl.store(output_ptr + offsets, output, mask=mask)

def add(x: torch.Tensor, y: torch.Tensor):
    output = torch.empty_like(x)
    n_elements = output.numel()
    grid = lambda meta: (triton.cdiv(n_elements, meta["BLOCK_SIZE"]),)
    add_kernel[grid](x, y, output, n_elements, BLOCK_SIZE=1024)
    return output

a = torch.rand(3, device="cuda")
b = a + a
b_compiled = add(a, a)
print(b_compiled - b)
print("If you see tensor([0., 0., 0.], device='cuda:0'), then it works")

実行

PowerShell

.\python_embeded\python.exe .\test.py

tensor([0., 0., 0.], device='cuda:0') と表示されれば大丈夫です。

もし、ImportError: DLL load failed while importing cuda_utils: 指定されたモジュールが見つかりません。 と表示された場合は、C:\Users\ユーザー名\.triton フォルダを削除し、再度実行してみてください。

学習済みモデルのダウンロード

VAEのダウンロード

hunyuan_video_vae_bf16.safetensors

huggingface.co

hunyuan_video_vae_bf16.safetensors · Kijai/HunyuanVideo_comfy at main

https://huggingface.co/Kijai/HunyuanVideo_comfy/blob/main/hunyuan_video_vae_bf16.safetensors

ファイルは下記のフォルダ内に置きます( hyvid フォルダがない場合は作成)。

ComfyUI_windows_portable\ComfyUI\models\vae\hyvid

Hunyuan Video model のダウンロード

hunyuan_video_720_cfgdistill_fp8_e4m3fn.safetensors
hunyuan_video_FastVideo_720_fp8_e4m3fn.safetensors

huggingface.co

hunyuan_video_720_cfgdistill_fp8_e4m3fn.safetensors · Kijai/HunyuanVideo_comf...

https://huggingface.co/Kijai/HunyuanVideo_comfy/blob/main/hunyuan_video_720_cfgdistill_fp8_e4m3fn.safetensors

huggingface.co

hunyuan_video_FastVideo_720_fp8_e4m3fn.safetensors · Kijai/HunyuanVideo_comfy...

https://huggingface.co/Kijai/HunyuanVideo_comfy/blob/main/hunyuan_video_FastVideo_720_fp8_e4m3fn.safetensors

下記のフォルダ内に置きます。

ComfyUI_windows_portable\ComfyUI\models\diffusion_models\hyvid

Leapfusion Hunyuan Image-to-Video Lora のダウンロード

今回はImage2Videoを試したいのでLeapfusion Hunyuan Image-to-Video Lora weightsをダウンロードします。

img2vid544p.safetensors

huggingface.co

img2vid544p.safetensors · leapfusion-image2vid-test/image2vid-960x544 at main

https://huggingface.co/leapfusion-image2vid-test/image2vid-960x544/blob/main/img2vid544p.safetensors

下記のフォルダ内に置きます。

ComfyUI_windows_portable\ComfyUI\models\loras\hyvid

実行

必要ノードのインストール

ComfyUI_windows_portable/run_nvidia_gpu.bat を実行し、ComfyUIの画面を開いておいてください。

Civitai にある、下記ワークフローを使います。

civitai.com

Img2Vid 𝑽𝟐 ▪ Hunyuan ▪ LeapFusion Lora V2 - Hun I2V | lora v2 | 1.0 | Hunyu...

https://civitai.com/models/1180764/img2vid-hunyuan-leapfusion-lora-v2

HUNYUAN | Img 2 Vid LeapFusion Requirements: 1) Kijai's nodes 2) LeapFusion Lora v2 (544p) or v1 (320p) YOU MUST UPDATE KIJAI NODES IF YOU HAVE THE...

zipファイルを解凍し、Hunyuan-Img2Vid-LeapFusion v2 1.0.json をComfyUIのブラウザ画面にドラッグ&ドロップします。

必要なカスタムノードがインストールされていないので、警告が出ます。

右上のComfyUI-Managerのアイコンをクリックし、「Install Missing Custom Nodes」をクリックします。

「Install」をクリックし、すべてインストールします。※バージョンはlatestを選択しました。

Comfy UI の再起動を促されるので、左下の「Restart」をクリックして再起動します。

再起動が終わり、ブラウザ画面がリフレッシュされるとエラーが消えていると思います。

各ノードの設定

生成前に各ノードの設定を行います。
設定は一例なので参考程度に。

IMG2VID LORA ノード
- lora: hyvid/img2vid544p.safetensors
HunyuanVideo Model Loader
- model: hyvid/hunyuan_video_720_FastVideo_fp8_e4m3fn.safetensors
- auto_cpu_offload: True
VAE
- model_name: hyvid/hunyuan_video_vae_bf16.safetensors
(Down)Load HunyuanVideo TextEncoder
- precision: bf16
- quantization: bnb_nf4
HunyuanVideo Sampler
- steps: 6
- embedded_guidance_scale: 6
- flow_shift: 20
HunyuanVideo Enhance A Video
- weight: 8.00
- start_percent: 0.0
- end_percent: 0.8
HunyuanVideo Decode
- spatial_tile_sample_min_size: 160 ※VRAMに合わせて調整