Coder Social home page Coder Social logo

Comments (4)

AppStolz avatar AppStolz commented on June 13, 2024 1

Welche Option ist hierfür bitte besser?

usage: inference.py [-h] [--driven_audio DRIVEN_AUDIO] [--source_image SOURCE_IMAGE]
                    [--ref_eyeblink REF_EYEBLINK] [--ref_pose REF_POSE]
                    [--checkpoint_dir CHECKPOINT_DIR] [--result_dir RESULT_DIR]
                    [--pose_style POSE_STYLE] [--batch_size BATCH_SIZE] [--size SIZE]
                    [--expression_scale EXPRESSION_SCALE] [--input_yaw INPUT_YAW [INPUT_YAW ...]]
                    [--input_pitch INPUT_PITCH [INPUT_PITCH ...]]
                    [--input_roll INPUT_ROLL [INPUT_ROLL ...]] [--enhancer ENHANCER]
                    [--background_enhancer BACKGROUND_ENHANCER] [--cpu] [--face3dvis] [--still]
                    [--preprocess {crop,extcrop,resize,full,extfull}] [--verbose] [--old_version]
                    [--net_recon {resnet18,resnet34,resnet50}] [--init_path INIT_PATH]
                    [--use_last_fc USE_LAST_FC] [--bfm_folder BFM_FOLDER] [--bfm_model BFM_MODEL]
                    [--focal FOCAL] [--center CENTER] [--camera_d CAMERA_D] [--z_near Z_NEAR]
                    [--z_far Z_FAR]

Option 1: Using --size for fixed sizes

img = 'examples/source_image/{}.png'.format(default_head_name.value)
!python3.8 inference.py --driven_audio ./examples/driven_audio/audio_profile.mp3 \
           --source_image {img} \
           --result_dir ./results --still --preprocess full --enhancer gfpgan --size 1080,1920

Option 2: Using --preprocess resize for aspect ratio adjustment

img = 'examples/source_image/{}.png'.format(default_head_name.value)
!python3.8 inference.py --driven_audio ./examples/driven_audio/audio_profile.mp3 \
           --source_image {img} \
           --result_dir ./results --still --preprocess resize --enhancer gfpgan

from sadtalker.

salimbenfarhat avatar salimbenfarhat commented on June 13, 2024

Which option is better to this please ?

usage: inference.py [-h] [--driven_audio DRIVEN_AUDIO] [--source_image SOURCE_IMAGE]
                    [--ref_eyeblink REF_EYEBLINK] [--ref_pose REF_POSE]
                    [--checkpoint_dir CHECKPOINT_DIR] [--result_dir RESULT_DIR]
                    [--pose_style POSE_STYLE] [--batch_size BATCH_SIZE] [--size SIZE]
                    [--expression_scale EXPRESSION_SCALE] [--input_yaw INPUT_YAW [INPUT_YAW ...]]
                    [--input_pitch INPUT_PITCH [INPUT_PITCH ...]]
                    [--input_roll INPUT_ROLL [INPUT_ROLL ...]] [--enhancer ENHANCER]
                    [--background_enhancer BACKGROUND_ENHANCER] [--cpu] [--face3dvis] [--still]
                    [--preprocess {crop,extcrop,resize,full,extfull}] [--verbose] [--old_version]
                    [--net_recon {resnet18,resnet34,resnet50}] [--init_path INIT_PATH]
                    [--use_last_fc USE_LAST_FC] [--bfm_folder BFM_FOLDER] [--bfm_model BFM_MODEL]
                    [--focal FOCAL] [--center CENTER] [--camera_d CAMERA_D] [--z_near Z_NEAR]
                    [--z_far Z_FAR]

from sadtalker.

Zeelyne avatar Zeelyne commented on June 13, 2024

Hello, i am confused with this problem too. I am working woth an image size of 2000*2000. By adding --size 2000 2000 at the end of my command (Option 1) , i got an error :unrecognized arguments: 2000. Then, I added merely --size 2000. The error turned into: FileNotFoundError: No such file or directory: "./checkpoints\SadTalker_V0.0.2_2000.safetensors". Do you know how to solve this issue? Many thanks

from sadtalker.

AppStolz avatar AppStolz commented on June 13, 2024

Hello, i am confused with this problem too. I am working woth an image size of 2000*2000. By adding --size 2000 2000 at the end of my command (Option 1) , i got an error :unrecognized arguments: 2000. Then, I added merely --size 2000. The error turned into: FileNotFoundError: No such file or directory: "./checkpoints\SadTalker_V0.0.2_2000.safetensors". Do you know how to solve this issue? Many thanks

Hey @Zeelyne,

I noticed that you're facing some issues with running the SadTalker inference script. I've identified a couple of potential issues with your command and have provided a solution below:

Fix for Unrecognized Arguments and FileNotFoundError

  1. Size Argument Format:
    The --size argument should be in a comma-separated format like --size 2000,2000 instead of --size 2000 2000.

  2. Checkpoint File:
    Ensure that the checkpoint file SadTalker_V0.0.2_2000.safetensors is present in the ./checkpoints directory.

Updated Command

Here's the updated command:

img = 'examples/source_image/{}.png'.format(default_head_name.value)
!python3.8 inference.py --driven_audio ./examples/driven_audio/audio_profile.mp3 \
           --source_image {img} \
           --result_dir ./results --still --preprocess full --enhancer gfpgan --size 2000,2000

from sadtalker.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.