* Replace audio whenever set via source
* use scale_face_landmark_5() in age_modifier
* Fix wording and ordering of options
* Adjust wording for face editor
* Fix wording for processors
* Switch order of frame colorizer options
* That condition is actual not needed
* Simplify UI layout API by removing pre_render()
* Clean args and safe cast ini values (#775)
* Clean args and safe cast ini values
* Clean args and safe cast ini values
* Clean args and safe cast ini values
* Introduce paths group
* Fix job list command and change order
* Add job list testing todo
* Fix spacing in typing
* Fix benchmark by ignoring audio
* Simplify and avoid knowing the provider values (#782)
* Fix logger table with empty value
* Complete Typing
---------
Co-authored-by: harisreedhar <h4harisreedhar.s.s@gmail.com>
* Cleanup after age modifier PR
* Cleanup after age modifier PR
* Use OpenVino 2024.2.0 for installer
* Prepare 3.0.0 for installer
* Fix benchmark suite, Introduce sync_item() for state manager
* Fix lint
* Render slide preview also in lower res
* Lower thread and queue count to avoid false usage
* Fix spacing
* Feat/jobs UI (#627)
* Jobs UI part1
* Change naming
* Jobs UI part2
* Jobs UI part3
* Jobs UI part4
* Jobs UI part4
* Jobs UI part5
* Jobs UI part6
* Jobs UI part7
* Jobs UI part8
* Jobs UI part9
* Jobs UI part10
* Jobs UI part11
* Jobs UI part12
* Fix rebase
* Jobs UI part13
* Jobs UI part14
* Jobs UI part15
* changes (#626)
* Remove useless ui registration
* Remove useless ui registration
* move job_list.py
replace [0] with get_first()
* optimize imports
* fix date None problem
add test job list
* Jobs UI part16
* Jobs UI part17
* Jobs UI part18
* Jobs UI part19
* Jobs UI part20
* Jobs UI part21
* Jobs UI part22
* move job_list_options
* Add label to job status checkbox group
* changes
* changes
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* Update some dependencies
* UI helper to convert 'none'
* validate job (#628)
* changes
* changes
* add test
* changes
* changes
* Minor adjustments
* Replace is_json with is_file
* Handle empty and invalid json in job_list
* Handle empty and invalid json in job_list
* Handle empty and invalid json in job_list
* Work on the job manager UI
* Cosmetic changes on common helper
* Just make it work for now
* Just make it work for now
* Just make it work for now
* Streamline the step index lookups
* Hide footer
* Simplify instant runner
* Simplify instant runner UI and job manager UI
* Fix empty step choices
* Fix empty step choices
* Fix none values in UI
* Rework on benchmark (add warmup) and job list
* Improve ValueAndUnit
* Add step 1 of x output
* Cosmetic changes on the UI
* Fix invalid job file names
* Update preview
* Introducing has_step() and sorting out insert behaviour
* Introducing has_step() and sorting out insert behaviour
* Add [ none ] to some job id dropdowns
* Make updated dropdown values kinda perfect
* Make updated dropdown values kinda perfect
* Fix testing
* Minor improvement on UI
* Fix false config lookup
* Remove TensorRT as our models are not made for it
* Feat/cli commands second try rev2 (#640)
* Refactor CLI to commands
* Refactor CLI to commands part2
* Refactor CLI to commands part3
* Refactor CLI to commands part4
* Rename everything to facefusion.py
* Refactor CLI to commands part5
* Refactor CLI to commands part6
* Adjust testing
* Fix lint
* Fix lint
* Fix lint
* Refactor CLI to commands part7
* Extend State typing
* Fix false config lookup, adjust logical orders
* Move away from passing program part1
* Move away from passing program part2
* Move away from passing program part3
* Fix lint
* Move away from passing program part4
* ui-args update
* ui-args update
* ui-args update
* temporary type fix
* Move away from passing program part5
* remove unused
* creates args.py
* Move away from passing program part6
* Move away from passing program part7
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* Minor optimizations
* Update commands in README
* Fix job-retry command
* Fix multi runs via UI
* add more job keys
* Cleanup codebase
* One method to create inference session (#641)
* One method to create inference session
* Remove warnings, as there are none
* Remember job id during processing
* Fix face masker config block
* Change wording
* Prevent age modifier from using CoreML
* add expression restorer (#642)
* add expression restorer
* fix import
* fix lint
* changes
* changes
* changes
* Host the final model for expression restorer
* Insert step on the given index
* UI workover (#644)
* UI workover part1
* Introduce ComponentOptions
* Only set Media components to None when visibility changes
* Clear static faces and reference faces between step processing
* Minor changes
* Minor changes
* Fix testing
* Enable test_sanitize_path_for_windows (#646)
* Dynamic download during job processing (#647)
* Fix face masker UI
* Rename run-headless to headless-run
* Feat/split frame processor UI (#649)
* Split frame processor UI
* Split frame processor UI part3, Refactor get_model_initializer
* Split frame processor UI part4
* Feat/rename frame processors (#651)
* Rename frame processors
* Rename frame processors part2
* Fix imports
Conflicts:
facefusion/uis/layouts/benchmark.py
facefusion/uis/layouts/default.py
* Fix imports
* Cosmetic changes
* Fix multi threading for ROCm
* Change temp frames pattern
* Adjust terminal help
* remove expression restorer (#653)
* Expression restorer as processor (#655)
* add expression restorer
* changes
* Cleanup code
* Add TensorRT support back
* Add TensorRT support back
* Add TensorRT support back
* changes (#656)
* Change minor wording
* Fix face enhancer slider
* Add more typing
* Fix expression-restorer when using trim (#659)
* changes
* changes
* Rework/model and inference pool part2 (#660)
* Rework on model and inference pool
* Introduce inference sources and pools part1
* Introduce inference sources and pools part2
* Introduce inference sources and pools part3
* Introduce inference sources and pools part4
* Introduce inference sources and pools part5
* Introduce inference sources and pools part6
* Introduce inference sources and pools part6
* Introduce inference sources and pools part6
* Introduce inference sources and pools part7
* Introduce inference sources and pools part7
* Introduce inference sources and pools part8
* Introduce inference sources and pools part9
* Introduce inference sources and pools part10
* Introduce inference sources and pools part11
* Introduce inference sources and pools part11
* Introduce inference sources and pools part11
* Introduce inference sources and pools part12
* Reorganize the face masker UI
* Fix trim in UI
* Feat/hashed sources (#668)
* Introduce source helper
* Remove post_check() and just use process_manager
* Remove post_check() part2
* Add hash based downloads
* Add hash based downloads part2
* Add hash based downloads part3
* Add hash based downloads part4
* Add hash based downloads part5
* Add hash based downloads part6
* Add hash based downloads part7
* Add hash based downloads part7
* Add hash based downloads part8
* Remove print
* Prepare 3.0.0 release
* Fix UI
* Release the check when really done
* Update inputs for live portrait
* Update to 3.0.0 releases, extend download postfix
* Move files to the right place
* Logging for the hash and source validation
* Changing logic to handle corrupt sources
* Fix typo
* Use names over get_inputs(), Remove set_options() call
* Age modifier now works for CoreML too
* Update age_modifier.py
* Add video encoder h264_videotoolbox and hevc_videotoolbox
* Face editor add eye gaze & remove open factor sliders (#670)
* changes
* add eye gaze
* changes
* cleanup
* add eyebrow control
* changes
* changes
* Feat/terminal UI (#671)
* Introduce terminal to the UI
* Introduce terminal to the UI part2
* Introduce terminal to the UI part2
* Introduce terminal to the UI part2
* Calc range step to avoid weird values
* Use Sequence for ranges
* Use Sequence for ranges
* changes (#673)
* Use Sequence for ranges
* Finalize terminal UI
* Finalize terminal UI
* Webcam cosmetics, Fix normalize fps to accept int
* Cosmetic changes
* Finalize terminal UI
* Rename leftover typings
* Fix wording
* Fix rounding in metavar
* Fix rounding in metavar
* Rename to face classifier
* Face editor lip moves (#677)
* changes
* changes
* changes
* Fix rounding in metavar
* Rename to face classifier
* changes
* changes
* update naming
---------
Co-authored-by: henryruhs <info@henryruhs.com>
* Fix wording
* Feat/many landmarker + face analyser breakdown (#678)
* Basic multi landmarker integration
* Simplify some method names
* Break into face_detector and face_landmarker
* Fix cosmetics
* Fix testing
* Break into face_attributor and face_recognizer
* Clear them all
* Clear them all
* Rename to face classifier
* Rename to face classifier
* Fix testing
* Fix stuff
* Add face landmarker model to UI
* Add face landmarker model to UI part2
* Split the config
* Split the UI
* Improvement from code review
* Improvement from code review
* Validate args also for sub parsers
* Remove clear of processors in process step
* Allow finder control for the face editor
* Fix lint
* Improve testing performance
* Remove unused file, Clear processors from the UI before job runs
* Update the installer
* Uniform set handler for swapper and detector in the UI
* Fix example urls
* Feat/inference manager (#684)
* Introduce inference manager
* Migrate all to inference manager
* clean ini
* Introduce app context based inference pools
* Fix lint
* Fix typing
* Adjust layout
* Less border radius
* Rename app context names
* Fix/live portrait directml (#691)
* changes (#690)
* Adjust naming
* Use our assets release
* Adjust naming
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* Add caches to gitignore
* Update dependencies and drop CUDA 11.8 support (#693)
* Update dependencies and drop CUDA 11.8 support
* Play save and keep numpy 1.x.x
* Improve TensorRT optimization
* changes
* changes
* changes
* changes
* changes
* changes
* changes
* changes
* changes
* Reuse inference sessions (#696)
* Fix force-download command
* Refactor processors to forward() (#698)
* Install tensorrt when selecting cuda
* Minor changes
* Use latest numpy
* Fix limit system memory
* Implement forward() for every inference (#699)
* Implement forward() for every inference
* Implement forward() for every inference
* Implement forward() for every inference
* Implement forward() for every inference
* changes
* changes
* changes
* changes
* Feat/fairface (#710)
* Replace gender_age model with fair face (#709)
* changes
* changes
* changes
* age dropdown to range-slider
* Cleanup code
* Cleanup code
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* Extend installer to set library paths for cuda and tensorrt (#707)
* Extend installer to set library paths for cuda and tensorrt
* Add refresh of conda env
* Remove invalid commands
* Set the conda env according to operating system
* Update for ROCm 6.2
* fix installer
* Aktualisieren von installer.py
* Add missing face selector keys
* Try to keep original LD_LIBRARY_PATH
* windows support installer
* Final touch to the installer
* Remove spaces
* Simplidy collect_model_downloads()
* Fix force download for once and forever
* Housekeeping (#715)
* changes
* changes
* changes
* Fix performance part1
* Fix mixed states (#689)
* Fix mixed states
* Add missing sync for job args
* Move UnionStateXXX to base typing
* Undo
* Remove UnionStateXXX
* Fix app context performance lookup (#717)
* Restore performance for inswapper
* Mover upper() to the logger
* Undo debugging
* Move TensorRT installation to docs
* Sort out log level typing, Add log level UI dropdown (#719)
* Fix inference pool part1
* Validate conda library paths existence
* Default face selector order to large-small
* Fix inference pool context according to execution provider (#720)
* Fix app context under Windows
* CUDA and TensorRT update for the installer
* Remove concept of static processor modules
* Revert false commit
* Change event order makes a difference
* Fix multi model context in inference pool (#721)
* Fix multi model context in inference pool
* Fix multi model context in inference pool part2
* Use latest gradio to avoid fastapi bug
* Rework on the Windows Installer
* Use embedding converter (#724)
* changes (#723)
* Upload models to official assets repo
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* Rework on the Windows Installer part2
* Resolve subprocess calls (#726)
* Experiment
* Resolve subprocess calls to cover edge cases like broken PATH
* Adjust wording
* Simplify code
* Rework on the Windows Installer part3
* Rework on the Windows Installer part4
* Numpy fix for older onnxruntime
* changes (#729)
* Add space
* Add MacOS installer
* Use favicon
* Fix disabled logger
* Layout polishing (#731)
* Update dependencies, Adjust many face landmarker logic
* Cosmetics changes
* Should be button
* Introduce randomized action button
* Fix update of lip syncer and expression restorer
* Stop sharing inference session this prevents flushing VRAM
* Fix test
* Fix urls
* Prepare release
* Vanish inquirer
* Sticky preview does not work on portrait images
* Sticky preview only for landscape images and videos
* remove gradio tunnel env
* Change wording and deeplinks
* increase peppa landmark score offset
* Change wording
* Graceful exit install.py
* Just adding a required
* Cannot use the exit_helper
* Rename our model
* Change color of face-landmark-68/5
* Limit liveportrait (#739)
* changes
* changes
* changes
* Cleanup
* Cleanup
---------
Co-authored-by: harisreedhar <h4harisreedhar.s.s@gmail.com>
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* limit expression restorer
* change expression restorer 0-100 range
* Use 256x icon
* changes
* changes
* changes
* changes
* Limit face editor rotation (#745)
* changes (#743)
* Finish euler methods
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* Use different coveralls badge
* Move about wording
* Shorten scope in the logger
* changes
* changes
* Shorten scope in the logger
* fix typo
* Simplify the arcface converter names
* Update preview
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
Co-authored-by: harisreedhar <h4harisreedhar.s.s@gmail.com>
* Modernize CI
* Modernize CI
* Modernize CI
* Implement dynamic config (#518)
* Implement dynamic config
* Fix apply config
* Move config to general
* Move config to general
* Move config to general
* Add Windows installer
* Add --open-browser
* Add Windows installer part2
* Use non-commercial license for the installer
* Fix create environment in installer
* Fix openvino for installer
* Fix conda for installer
* Fix conda for installer, Remove python and pip as it is part of conda
* Improve installer - guess the path
* Fix CI
* Add missing accept-source-agreements to installer
* Install WinGet
* Improve WinGet installation steps
* Use absolute path for winget
* More installer polishing
* Add final page to installer, disable version check for Gradio
* Remove finish page again
* Use NEXT for metadata
* Support for /S mode
* Use winget-less approach
* Improve Conda uninstall
* Improve code using platform helpers (#529)
* Update dependencies
* Feat/fix windows unicode paths (#531)
* Fix the Windows unicode path dilemma
* Update dependencies
* Fix the Windows unicode path dilemma part2
* Remove conda environment on uninstall
* Fix uninstall command
* Install apps for local user only
* Add ultra sharp
* Add clear reality
* Update README and FUNDING
* Update FUNDING.yml
* Prevent preview of large videos in Gradio (#540)
* Fix order
* Refactor temporary file management, Use temporary file for image processing (#542)
* Allow webm on target component
* Reduce mosaic effect for frame processors
* clear static faces on trim frame changes
* Fix trim frame component
* Downgrade openvino dependency
* Prepare next release
* Move get_short_path to filesystem, Add/Improve some testing
* Prepare installer, Prevent infinite loop for sanitize_path_for_windows
* Introduce execution device id
* Introduce execution device id
* Seems like device id can be a string
* Seems like device id can be a string
* Make Intel Arc work with OpenVINOExecution
* Use latest Git
* Update wording
* Fix create_float_range
* Update preview
* Fix Git link
* Validate the overrides from facefusion.ini
* Break down cli testing
* Remove architecture lookup to support old driver
* Remove architecture lookup to support old driver
* Remove hwaccel auto
* Respect the output video resolution
* Bump next version
* Full directml support (#501)
* Introduce conditional thread management for DML support
* Finish migration to thread helpers
* Introduce dynamic frame colorizer sizes
* Introduce dynamic frame colorizer sizes
* Add 192x192 to frame colorizer
* Fix async audio
* Add real_hatgan_x4 model
* Mark it as NEXT
* Force download to be executed and exit
* Fix frame per second interpolation
* 5 to 68 landmark (#456)
* changes
* changes
* Adjust model url
* Cleanup 5 to 68 landmark convertion
* Move everything to face analyser
* Introduce matrix only face helper
* Revert facefusion.ini
* Adjust limit due false positive analysis
* changes (#457)
* Use pixel format yuv422p to merge video
* Fix some code
* Minor cleanup
* Add gpen_bfr_1024 and gpen_bfr_2048
* Revert it back to yuv420p due compatibility issues
* Add debug back to ffmpeg
* Add debug back to ffmpeg
* Migrate to conda (#461)
* Migrate from venv to conda
* Migrate from venv to conda
* Message when conda is not activated
* Use release for every slider (#463)
* Use release event handler for every slider
* Move more sliders to release handler
* Move more sliders to release handler
* Add get_ui_components() to simplify code
* Revert some changes on frame slider
* Add the first iteration of a frame colorizer
* Support for the DDColor model
* Improve model file handling
* Improve model file handling part2
* Remove deoldify
* Remove deoldify
* Voice separator (#468)
* changes
* changes
* changes
* changes
* changes
* changes
* Rename audio extractor to voice extractor
* Cosmetic changes
* Cosmetic changes
* Fix fps lowering and boosting
* Fix fps lowering and boosting
* Fix fps lowering and boosting
* Some refactoring for audio.py and some astype() here and there (#470)
* Some refactoring for audio.py and some astype() here and there
* Fix lint
* Spacing
* Add mp3 to benchmark suite for lip syncer testing
* Improve naming
* Adjust chunk size
* Use higher quality
* Revert "Use higher quality"
This reverts commit d32f287572.
* Improve naming in ffmpeg.py
* Simplify code
* Better fps calculation
* Fix naming here and there
* Add back real esrgan x2
* Remove trailing comma
* Update wording and README
* Use semaphore to prevent frame colorizer memory issues
* Revert "Remove deoldify"
This reverts commit bd8034cbc7.
* Remove unused type from frame colorizer
* Adjust naming
* Add missing clear of model initializer
* Change nvenc preset mappping to support old FFMPEG 4
* Update onnxruntime to 1.17.1
* Fix lint
* Prepare 2.5.0
* Fix Gradio overrides
* Add Deoldify Artistic back
* Feat/audio refactoring (#476)
* Improve audio naming and variables
* Improve audio naming and variables
* Refactor voice extractor like crazy
* Refactor voice extractor like crazy
* Remove spaces
* Update the usage
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* Update official url for cuda 12-2 wheels
* Fix preview for audio to image
* Prevent download loop when remote is unreachable
* Prevent download loop when remote is unreachable
* changes (#444)
* Tidy up monkey patch
* Use cpu core count for concurrency count
* Dynamic concurrency_count for ideal Gradio performance
* Conditional download face analyser models
* Fix testing via pre_check()
* Introduce checking to process manager for blocking the UI
* Introduce checking to process manager for blocking the UI
* Introduce checking to process manager for blocking the UI
* Introduce checking to process manager for blocking the UI
* Move the blocking while model download to the correct position
* Remove unused imports
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* Rename landmark 5 variables
* Mark as NEXT
* Render tabs for multiple ui layout usage
* Allow many face detectors at once, Add face detector tweaks
* Remove face detector tweaks for now (kinda placebo)
* Fix lint issues
* Allow rendering the landmark-5 and landmark-5/68 via debugger
* Fix naming
* Convert face landmark based on confidence score
* Convert face landmark based on confidence score
* Add scrfd face detector model (#397)
* Add scrfd face detector model
* Switch to scrfd_2.5g.onnx model
* Just some renaming
* Downgrade OpenCV, Add SYSTEM_VERSION_COMPAT=0 for MacOS
* Improve naming
* prepare detect frame outside of semaphore
* Feat/process manager (#399)
* Minor naming
* Introduce process manager to start and stop
* Introduce process manager to start and stop
* Introduce process manager to start and stop
* Introduce process manager to start and stop
* Introduce process manager to start and stop
* Remove useless test for now
* Avoid useless variables
* Show stop once is_processing is True
* Allow to stop ffmpeg processing too
* Implement output image resolution (#403)
* Implement output image resolution
* Reorder code
* Simplify output logic and therefore fix bug
* Frame-enhancer-onnx (#404)
* changes
* changes
* changes
* changes
* add models
* update workflow
* Some cleanup
* Some cleanup
* Feat/frame enhancer polishing (#410)
* Some cleanup
* Polish the frame enhancer
* Frame Enhancer: Add more models, optimize processing
* Minor changes
* Improve readability of create_tile_frames and merge_tile_frames
* We don't have enough models yet
* Feat/face landmarker score (#413)
* Introduce face landmarker score
* Fix testing
* Fix testing
* Use release for score related sliders
* Reduce face landmark fallbacks
* Scores and landmarks in Face dict, Change color-theme in face debugger
* Scores and landmarks in Face dict, Change color-theme in face debugger
* Fix some naming
* Add 8K support (for whatever reasons)
* Fix testing
* Using get() for face.landmarks
* Introduce statistics
* More statistics
* Limit the histogram equalization
* Enable queue() for default layout
* Improve copy_image()
* Fix error when switching detector model
* Always set UI values with globals if possible
* Use different logic for output image and output video resolutions
* Enforce re-download if file size is off
* Remove unused method
* Remove unused method
* Remove unused warning filter
* Improved output path normalization (#419)
* Handle some exceptions
* Handle some exceptions
* Cleanup
* Prevent countless thread locks
* Listen to user feedback
* Fix webp edge case
* Feat/cuda device detection (#424)
* Introduce cuda device detection
* Introduce cuda device detection
* it's gtx
* Move logic to run_nvidia_smi()
* Finalize execution device naming
* Finalize execution device naming
* Merge execution_helper.py to execution.py
* Undo lowercase of values
* Undo lowercase of values
* Finalize naming
* Add missing entry to ini
* fix lip_syncer preview (#426)
* fix lip_syncer preview
* change
* Refresh preview on trim changes
* Cleanup frame enhancers and remove useless scale in merge_video() (#428)
* Keep lips over the whole video once lip syncer is enabled (#430)
* Keep lips over the whole video once lip syncer is enabled
* changes
* changes
* Fix spacing
* Use empty audio frame on silence
* Use empty audio frame on silence
* Fix ConfigParser encoding (#431)
facefusion.ini is UTF8 encoded but config.py doesn't specify encoding which results in corrupted entries when non english characters are used.
Affected entries:
source_paths
target_path
output_path
* Adjust spacing
* Improve the GTX 16 series detection
* Use general exception to catch ParseError
* Use general exception to catch ParseError
* Host frame enhancer models4
* Use latest onnxruntime
* Minor changes in benchmark UI
* Different approach to cancel ffmpeg process
* Add support for amd amf encoders (#433)
* Add amd_amf encoders
* remove -rc cqp from amf encoder parameters
* Improve terminal output, move success messages to debug mode
* Improve terminal output, move success messages to debug mode
* Minor update
* Minor update
* onnxruntime 1.17.1 matches cuda 12.2
* Feat/improved scaling (#435)
* Prevent useless temp upscaling, Show resolution and fps in terminal output
* Remove temp frame quality
* Remove temp frame quality
* Tiny cleanup
* Default back to png for temp frames, Remove pix_fmt from frame extraction due mjpeg error
* Fix inswapper fallback by onnxruntime
* Fix inswapper fallback by major onnxruntime
* Fix inswapper fallback by major onnxruntime
* Add testing for vision restrict methods
* Fix left / right face mask regions, add left-ear and right-ear
* Flip right and left again
* Undo ears - does not work with box mask
* Prepare next release
* Fix spacing
* 100% quality when using jpg for temp frames
* Use span_kendata_x4 as default as of speed
* benchmark optimal tile and pad
* Undo commented out code
* Add real_esrgan_x4_fp16 model
* Be strict when using many face detectors
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
Co-authored-by: aldemoth <159712934+aldemoth@users.noreply.github.com>
* feat/yoloface (#334)
* added yolov8 to face_detector (#323)
* added yolov8 to face_detector
* added yolov8 to face_detector
* Initial cleanup and renaming
* Update README
* refactored detect_with_yoloface (#329)
* refactored detect_with_yoloface
* apply review
* Change order again
* Restore working code
* modified code (#330)
* refactored detect_with_yoloface
* apply review
* use temp_frame in detect_with_yoloface
* reorder
* modified
* reorder models
* Tiny cleanup
---------
Co-authored-by: tamoharu <133945583+tamoharu@users.noreply.github.com>
* include audio file functions (#336)
* Add testing for audio handlers
* Change order
* Fix naming
* Use correct typing in choices
* Update help message for arguments, Notation based wording approach (#347)
* Update help message for arguments, Notation based wording approach
* Fix installer
* Audio functions (#345)
* Update ffmpeg.py
* Create audio.py
* Update ffmpeg.py
* Update audio.py
* Update audio.py
* Update typing.py
* Update ffmpeg.py
* Update audio.py
* Rename Frame to VisionFrame (#346)
* Minor tidy up
* Introduce audio testing
* Add more todo for testing
* Add more todo for testing
* Fix indent
* Enable venv on the fly
* Enable venv on the fly
* Revert venv on the fly
* Revert venv on the fly
* Force Gradio to shut up
* Force Gradio to shut up
* Clear temp before processing
* Reduce terminal output
* include audio file functions
* Enforce output resolution on merge video
* Minor cleanups
* Add age and gender to face debugger items (#353)
* Add age and gender to face debugger items
* Rename like suggested in the code review
* Fix the output framerate vs. time
* Lip Sync (#356)
* Cli implementation of wav2lip
* - create get_first_item()
- remove non gan wav2lip model
- implement video memory strategy
- implement get_reference_frame()
- implement process_image()
- rearrange crop_mask_list
- implement test_cli
* Simplify testing
* Rename to lip syncer
* Fix testing
* Fix testing
* Minor cleanup
* Cuda 12 installer (#362)
* Make cuda nightly (12) the default
* Better keep legacy cuda just in case
* Use CUDA and ROCM versions
* Remove MacOS options from installer (CoreML include in default package)
* Add lip-syncer support to source component
* Add lip-syncer support to source component
* Fix the check in the source component
* Add target image check
* Introduce more helpers to suite the lip-syncer needs
* Downgrade onnxruntime as of buggy 1.17.0 release
* Revert "Downgrade onnxruntime as of buggy 1.17.0 release"
This reverts commit f4a7ae6824.
* More testing and add todos
* Fix the frame processor API to at least not throw errors
* Introduce dict based frame processor inputs (#364)
* Introduce dict based frame processor inputs
* Forgot to adjust webcam
* create path payloads (#365)
* create index payload to paths for process_frames
* rename to payload_paths
* This code now is poetry
* Fix the terminal output
* Make lip-syncer work in the preview
* Remove face debugger test for now
* Reoder reference_faces, Fix testing
* Use inswapper_128 on buggy onnxruntime 1.17.0
* Undo inswapper_128_fp16 duo broken onnxruntime 1.17.0
* Undo inswapper_128_fp16 duo broken onnxruntime 1.17.0
* Fix lip_syncer occluder & region mask issue
* Fix preview once in case there was no output video fps
* fix lip_syncer custom fps
* remove unused import
* Add 68 landmark functions (#367)
* Add 68 landmark model
* Add landmark to face object
* Re-arrange and modify typing
* Rename function
* Rearrange
* Rearrange
* ignore type
* ignore type
* change type
* ignore
* name
* Some cleanup
* Some cleanup
* Opps, I broke something
* Feat/face analyser refactoring (#369)
* Restructure face analyser and start TDD
* YoloFace and Yunet testing are passing
* Remove offset from yoloface detection
* Cleanup code
* Tiny fix
* Fix get_many_faces()
* Tiny fix (again)
* Use 320x320 fallback for retinaface
* Fix merging mashup
* Upload wave2lip model
* Upload 2dfan2 model and rename internal to face_predictor
* Downgrade onnxruntime for most cases
* Update for the face debugger to render landmark 68
* Try to make detect_face_landmark_68() and detect_gender_age() more uniform
* Enable retinaface testing for 320x320
* Make detect_face_landmark_68() and detect_gender_age() as uniform as … (#370)
* Make detect_face_landmark_68() and detect_gender_age() as uniform as possible
* Revert landmark scale and translation
* Make box-mask for lip-syncer adjustable
* Add create_bbox_from_landmark()
* Remove currently unused code
* Feat/uniface (#375)
* add uniface (#373)
* Finalize UniFace implementation
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* My approach how todo it
* edit
* edit
* replace vertical blur with gaussian
* remove region mask
* Rebase against next and restore method
* Minor improvements
* Minor improvements
* rename & add forehead padding
* Adjust and host uniface model
* Use 2dfan4 model
* Rename to face landmarker
* Feat/replace bbox with bounding box (#380)
* Add landmark 68 to 5 convertion
* Add landmark 68 to 5 convertion
* Keep 5, 5/68 and 68 landmarks
* Replace kps with landmark
* Replace bbox with bounding box
* Reshape face_landmark5_list different
* Make yoloface the default
* Move convert_face_landmark_68_to_5 to face_helper
* Minor spacing issue
* Dynamic detector sizes according to model (#382)
* Dynamic detector sizes according to model
* Dynamic detector sizes according to model
* Undo false commited files
* Add lib syncer model to the UI
* fix halo (#383)
* Bump to 2.3.0
* Update README and wording
* Update README and wording
* Fix spacing
* Apply _vision suffix
* Apply _vision suffix
* Apply _vision suffix
* Apply _vision suffix
* Apply _vision suffix
* Apply _vision suffix
* Apply _vision suffix, Move mouth mask to face_masker.py
* Apply _vision suffix
* Apply _vision suffix
* increase forehead padding
---------
Co-authored-by: tamoharu <133945583+tamoharu@users.noreply.github.com>
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* renaming and restructuring (#282)
* Renaming and restructuring
* Renaming and restructuring
* Renaming and restructuring
* Fix gender detection
* Implement distance to face debugger
* Implement distance to face debugger part2
* Implement distance to face debugger part3
* Mark as next
* Fix reference when face_debugger comes first
* Use official onnxruntime nightly
* CUDA on steroids
* CUDA on steroids
* Add some testing
* Set inswapper_128_fp16 as default
* Feat/block until post check (#292)
* Block until download is done
* Introduce post_check()
* Fix webcam
* Update dependencies
* Add --force-reinstall to installer
* Introduce config ini (#298)
* Introduce config ini
* Fix output video encoder
* Revert help listings back to commas, Move SSL hack to download.py
* Introduce output-video-preset which defaults to veryfast
* Mapping for nvenc encoders
* Rework on events and non-blocking UI
* Add fast bmp to temp_frame_formats
* Add fast bmp to temp_frame_formats
* Show total processing time on success
* Show total processing time on success
* Show total processing time on success
* Move are_images, is_image and is_video back to filesystem
* Fix some spacings
* Pissing everyone of by renaming stuff
* Fix seconds output
* feat/video output fps (#312)
* added output fps slider, removed 'keep fps' option (#311)
* added output fps slider, removed 'keep fps' option
* now uses passed fps instead of global fps for ffmpeg
* fps values are now floats instead of ints
* fix previous commit
* removed default value from fps slider
this is so we can implement a dynamic default value later
* Fix seconds output
* Some cleanup
---------
Co-authored-by: Ran Shaashua <47498956+ranshaa05@users.noreply.github.com>
* Allow 0.01 steps for fps
* Make fps unregulated
* Make fps unregulated
* Remove distance from face debugger again (does not work)
* Fix gender age
* Fix gender age
* Hotfix benchmark suite
* Warp face normalize (#313)
* use normalized kp templates
* Update face_helper.py
* My 50 cents to warp_face()
---------
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>
* face-swapper-weight (#315)
* Move prepare_crop_frame and normalize_crop_frame out of apply_swap
* Fix UI bug with different range
* feat/output video resolution (#316)
* Introduce detect_video_resolution, Rename detect_fps to detect_video_fps
* Add calc_video_resolution_range
* Make output resolution work, does not auto-select yet
* Make output resolution work, does not auto-select yet
* Try to keep the origin resolution
* Split code into more fragments
* Add pack/unpack resolution
* Move video_template_sizes to choices
* Improve create_video_resolutions
* Reword benchmark suite
* Optimal speed for benchmark
* Introduce different video memory strategies, rename max_memory to max… (#317)
* Introduce different video memory strategies, rename max_memory to max_system_memory
* Update readme
* Fix limit_system_memory call
* Apply video_memory_strategy to face debugger
* Limit face swapper weight to 3.0
* Remove face swapper weight due bad render outputs
* Show/dide logic for output video preset
* fix uint8 conversion
* Fix whitespace
* Finalize layout and update preview
* Fix multi renders on face debugger
* Restore less restrictive rendering of preview and stream
* Fix block mode for model downloads
* Add testing
* Cosmetic changes
* Enforce valid fps and resolution via CLI
* Empty config
* Cosmetics on args processing
* Memory workover (#319)
* Cosmetics on args processing
* Fix for MacOS
* Rename all max_ to _limit
* More fixes
* Update preview
* Fix whitespace
---------
Co-authored-by: Ran Shaashua <47498956+ranshaa05@users.noreply.github.com>
Co-authored-by: Harisreedhar <46858047+harisreedhar@users.noreply.github.com>