
* Add real_hatgan_x4 model * Mark it as NEXT * Force download to be executed and exit * Fix frame per second interpolation * 5 to 68 landmark (#456) * changes * changes * Adjust model url * Cleanup 5 to 68 landmark convertion * Move everything to face analyser * Introduce matrix only face helper * Revert facefusion.ini * Adjust limit due false positive analysis * changes (#457) * Use pixel format yuv422p to merge video * Fix some code * Minor cleanup * Add gpen_bfr_1024 and gpen_bfr_2048 * Revert it back to yuv420p due compatibility issues * Add debug back to ffmpeg * Add debug back to ffmpeg * Migrate to conda (#461) * Migrate from venv to conda * Migrate from venv to conda * Message when conda is not activated * Use release for every slider (#463) * Use release event handler for every slider * Move more sliders to release handler * Move more sliders to release handler * Add get_ui_components() to simplify code * Revert some changes on frame slider * Add the first iteration of a frame colorizer * Support for the DDColor model * Improve model file handling * Improve model file handling part2 * Remove deoldify * Remove deoldify * Voice separator (#468) * changes * changes * changes * changes * changes * changes * Rename audio extractor to voice extractor * Cosmetic changes * Cosmetic changes * Fix fps lowering and boosting * Fix fps lowering and boosting * Fix fps lowering and boosting * Some refactoring for audio.py and some astype() here and there (#470) * Some refactoring for audio.py and some astype() here and there * Fix lint * Spacing * Add mp3 to benchmark suite for lip syncer testing * Improve naming * Adjust chunk size * Use higher quality * Revert "Use higher quality" This reverts commitd32f287572
. * Improve naming in ffmpeg.py * Simplify code * Better fps calculation * Fix naming here and there * Add back real esrgan x2 * Remove trailing comma * Update wording and README * Use semaphore to prevent frame colorizer memory issues * Revert "Remove deoldify" This reverts commitbd8034cbc7
. * Remove unused type from frame colorizer * Adjust naming * Add missing clear of model initializer * Change nvenc preset mappping to support old FFMPEG 4 * Update onnxruntime to 1.17.1 * Fix lint * Prepare 2.5.0 * Fix Gradio overrides * Add Deoldify Artistic back * Feat/audio refactoring (#476) * Improve audio naming and variables * Improve audio naming and variables * Refactor voice extractor like crazy * Refactor voice extractor like crazy * Remove spaces * Update the usage --------- Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
112 lines
12 KiB
Markdown
112 lines
12 KiB
Markdown
FaceFusion
|
|
==========
|
|
|
|
> Next generation face swapper and enhancer.
|
|
|
|
[](https://github.com/facefusion/facefusion/actions?query=workflow:ci)
|
|

|
|
|
|
|
|
Preview
|
|
-------
|
|
|
|

|
|
|
|
|
|
Installation
|
|
------------
|
|
|
|
Be aware, the installation needs technical skills and is not for beginners. Please do not open platform and installation related issues on GitHub. We have a very helpful [Discord](https://join.facefusion.io) community that will guide you to complete the installation.
|
|
|
|
Get started with the [installation](https://docs.facefusion.io/installation) guide.
|
|
|
|
|
|
Usage
|
|
-----
|
|
|
|
Run the command:
|
|
|
|
```
|
|
python run.py [options]
|
|
|
|
options:
|
|
-h, --help show this help message and exit
|
|
-s SOURCE_PATHS, --source SOURCE_PATHS choose single or multiple source images or audios
|
|
-t TARGET_PATH, --target TARGET_PATH choose single target image or video
|
|
-o OUTPUT_PATH, --output OUTPUT_PATH specify the output file or directory
|
|
-v, --version show program's version number and exit
|
|
|
|
misc:
|
|
--force-download force automate downloads and exit
|
|
--skip-download omit automate downloads and remote lookups
|
|
--headless run the program without a user interface
|
|
--log-level {error,warn,info,debug} adjust the message severity displayed in the terminal
|
|
|
|
execution:
|
|
--execution-providers EXECUTION_PROVIDERS [EXECUTION_PROVIDERS ...] accelerate the model inference using different providers (choices: cpu, ...)
|
|
--execution-thread-count [1-128] specify the amount of parallel threads while processing
|
|
--execution-queue-count [1-32] specify the amount of frames each thread is processing
|
|
|
|
memory:
|
|
--video-memory-strategy {strict,moderate,tolerant} balance fast frame processing and low VRAM usage
|
|
--system-memory-limit [0-128] limit the available RAM that can be used while processing
|
|
|
|
face analyser:
|
|
--face-analyser-order {left-right,right-left,top-bottom,bottom-top,small-large,large-small,best-worst,worst-best} specify the order in which the face analyser detects faces
|
|
--face-analyser-age {child,teen,adult,senior} filter the detected faces based on their age
|
|
--face-analyser-gender {female,male} filter the detected faces based on their gender
|
|
--face-detector-model {many,retinaface,scrfd,yoloface,yunet} choose the model responsible for detecting the face
|
|
--face-detector-size FACE_DETECTOR_SIZE specify the size of the frame provided to the face detector
|
|
--face-detector-score [0.0-1.0] filter the detected faces base on the confidence score
|
|
--face-landmarker-score [0.0-1.0] filter the detected landmarks base on the confidence score
|
|
|
|
face selector:
|
|
--face-selector-mode {many,one,reference} use reference based tracking or simple matching
|
|
--reference-face-position REFERENCE_FACE_POSITION specify the position used to create the reference face
|
|
--reference-face-distance [0.0-1.5] specify the desired similarity between the reference face and target face
|
|
--reference-frame-number REFERENCE_FRAME_NUMBER specify the frame used to create the reference face
|
|
|
|
face mask:
|
|
--face-mask-types FACE_MASK_TYPES [FACE_MASK_TYPES ...] mix and match different face mask types (choices: box, occlusion, region)
|
|
--face-mask-blur [0.0-1.0] specify the degree of blur applied the box mask
|
|
--face-mask-padding FACE_MASK_PADDING [FACE_MASK_PADDING ...] apply top, right, bottom and left padding to the box mask
|
|
--face-mask-regions FACE_MASK_REGIONS [FACE_MASK_REGIONS ...] choose the facial features used for the region mask (choices: skin, left-eyebrow, right-eyebrow, left-eye, right-eye, eye-glasses, nose, mouth, upper-lip, lower-lip)
|
|
|
|
frame extraction:
|
|
--trim-frame-start TRIM_FRAME_START specify the the start frame of the target video
|
|
--trim-frame-end TRIM_FRAME_END specify the the end frame of the target video
|
|
--temp-frame-format {bmp,jpg,png} specify the temporary resources format
|
|
--keep-temp keep the temporary resources after processing
|
|
|
|
output creation:
|
|
--output-image-quality [0-100] specify the image quality which translates to the compression factor
|
|
--output-image-resolution OUTPUT_IMAGE_RESOLUTION specify the image output resolution based on the target image
|
|
--output-video-encoder {libx264,libx265,libvpx-vp9,h264_nvenc,hevc_nvenc,h264_amf,hevc_amf} specify the encoder use for the video compression
|
|
--output-video-preset {ultrafast,superfast,veryfast,faster,fast,medium,slow,slower,veryslow} balance fast video processing and video file size
|
|
--output-video-quality [0-100] specify the video quality which translates to the compression factor
|
|
--output-video-resolution OUTPUT_VIDEO_RESOLUTION specify the video output resolution based on the target video
|
|
--output-video-fps OUTPUT_VIDEO_FPS specify the video output fps based on the target video
|
|
--skip-audio omit the audio from the target video
|
|
|
|
frame processors:
|
|
--frame-processors FRAME_PROCESSORS [FRAME_PROCESSORS ...] load a single or multiple frame processors. (choices: face_debugger, face_enhancer, face_swapper, frame_colorizer, frame_enhancer, lip_syncer, ...)
|
|
--face-debugger-items FACE_DEBUGGER_ITEMS [FACE_DEBUGGER_ITEMS ...] load a single or multiple frame processors (choices: bounding-box, face-landmark-5, face-landmark-5/68, face-landmark-68, face-landmark-68/5, face-mask, face-detector-score, face-landmarker-score, age, gender)
|
|
--face-enhancer-model {codeformer,gfpgan_1.2,gfpgan_1.3,gfpgan_1.4,gpen_bfr_256,gpen_bfr_512,gpen_bfr_1024,gpen_bfr_2048,restoreformer_plus_plus} choose the model responsible for enhancing the face
|
|
--face-enhancer-blend [0-100] blend the enhanced into the previous face
|
|
--face-swapper-model {blendswap_256,inswapper_128,inswapper_128_fp16,simswap_256,simswap_512_unofficial,uniface_256} choose the model responsible for swapping the face
|
|
--frame-colorizer-model {ddcolor,ddcolor_artistic,deoldify_artistic} choose the model responsible for colorizing the frame
|
|
--frame-colorizer-blend [0-100] blend the colorized into the previous frame
|
|
--frame-enhancer-model {lsdir_x4,nomos8k_sc_x4,real_esrgan_x2,real_esrgan_x2_fp16,real_esrgan_x4,real_esrgan_x4_fp16,real_hatgan_x4,span_kendata_x4} choose the model responsible for enhancing the frame
|
|
--frame-enhancer-blend [0-100] blend the enhanced into the previous frame
|
|
--lip-syncer-model {wav2lip_gan} choose the model responsible for syncing the lips
|
|
|
|
uis:
|
|
--ui-layouts UI_LAYOUTS [UI_LAYOUTS ...] launch a single or multiple UI layouts (choices: benchmark, default, webcam, ...)
|
|
```
|
|
|
|
|
|
Documentation
|
|
-------------
|
|
|
|
Read the [documentation](https://docs.facefusion.io) for a deep dive.
|