Coder Social home page Coder Social logo

Comments (10)

paul-gauthier avatar paul-gauthier commented on May 29, 2024 3

Thanks for trying aider and filing this issue. No editing format is going to work 100% reliably with any of the GPT models. I have been using a pair of extensive benchmarking suites to try and make informed, quantitative decisions when implementing and improving the editing formats. But even so, sometimes the LLM will mess up.

So please keep me posted on problems you are seeing. They might hint at possible ways to improve the editing format.

from aider.

carlos-dc avatar carlos-dc commented on May 29, 2024 1

@carlos-dc what LLM are you using? Just curious

I am using gpt-4-1106-preview

from aider.

paul-gauthier avatar paul-gauthier commented on May 29, 2024 1

On benchmarks gpt-4-1106-preview seems to do better, but my sense is that gpt-4-0613 might actually be more capable at complex coding.

from aider.

batmanscode avatar batmanscode commented on May 29, 2024

@carlos-dc what LLM are you using? Just curious

from aider.

batmanscode avatar batmanscode commented on May 29, 2024

@carlos-dc thanks. Personally I've noticed a drop in quality with the turbo preview. In addition to an increased response time

from aider.

batmanscode avatar batmanscode commented on May 29, 2024

That's a great approach @paul-gauthier, appreciate the effort and that's probably the best one can do

Out of curiosity, how has the new turbo preview model been compared to gpt-4? In your benchmarks

Asking because I was using gpt-4-1106-preview when all of a sudden it got ~2x slower and following instructions poorly enough that I had to switch to the regular gpt-4

from aider.

batmanscode avatar batmanscode commented on May 29, 2024

On benchmarks gpt-4-1106-preview seems to do better, but my sense is that gpt-4-0613 might actually be more capable at complex coding.

Very interesting, thanks

from aider.

jimcraner avatar jimcraner commented on May 29, 2024

@paul-gauthier:

(First, aider is awesome -- kudos to you for developing it! And thanks for releasing it! :-)

Second: is there a good way for us to provide data for you about this reliability problem?

For 0.18, using 4-turbo and the SEARCH/REPLACE model, aider worked great (Django, standalone python, Laravel, HTML/CSS, HTMX, vanilla JS)

Since upgrading to 0.19, and now 0.20, using the diff edit model, I have had zero successful edits. I've tried on multiple Django projects and a Laravel project.

I'm going to downgrade to 0.18 for now, but if there is any sort of useful information or data that we can provide to you, please let me know. I'd love to help improve the app!

Thanks again!

from aider.

paul-gauthier avatar paul-gauthier commented on May 29, 2024

@jimcraner thanks for the info on the problems you are having.

You can try the latest version of aider v0.21.0 which has some improvements to the unified diff editing format. Alternatively, you can always run aider with --model gpt-4-1106-preview --edit-format diff to use the old SEARCH/REPLACE edit format with gpt 4 turbo.

I would love any concrete examples you have of editing failures. To be most useful, I need:

  • Aider version, model, and edit-format settings. Ideally you can just copy all the "announcement" lines when you run aider which report all of this and other helpful info.
  • A copy of the diffs which failed to apply. You can find these in .aider.chat.history.md.
  • A copy of the source file that was being updated, or at least the chunk that is mentioned in the failing diffs.

from aider.

paul-gauthier avatar paul-gauthier commented on May 29, 2024

I'm going to close this issue for now, but feel free to add a comment here and I will re-open or file a new issue any time.

from aider.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.