Hello team, I just recently updated to Aider v0.19.1 and I see that

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

That's a great approach <a class="user-mention notranslate" data-hovercard-type="user"

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

New diff approach to making changes has not been very reliable. about aider HOT 10 CLOSED

carlos-dc commented on May 29, 2024 1

New diff approach to making changes has not been very reliable.

from aider.

Comments (10)

paul-gauthier commented on May 29, 2024 3

Thanks for trying aider and filing this issue. No editing format is going to work 100% reliably with any of the GPT models. I have been using a pair of extensive benchmarking suites to try and make informed, quantitative decisions when implementing and improving the editing formats. But even so, sometimes the LLM will mess up.

So please keep me posted on problems you are seeing. They might hint at possible ways to improve the editing format.

from aider.

carlos-dc commented on May 29, 2024 1

@carlos-dc what LLM are you using? Just curious

I am using gpt-4-1106-preview

from aider.

paul-gauthier commented on May 29, 2024 1

On benchmarks gpt-4-1106-preview seems to do better, but my sense is that gpt-4-0613 might actually be more capable at complex coding.

from aider.

batmanscode commented on May 29, 2024

@carlos-dc what LLM are you using? Just curious

from aider.

batmanscode commented on May 29, 2024

@carlos-dc thanks. Personally I've noticed a drop in quality with the turbo preview. In addition to an increased response time

from aider.

batmanscode commented on May 29, 2024

That's a great approach @paul-gauthier, appreciate the effort and that's probably the best one can do

Out of curiosity, how has the new turbo preview model been compared to gpt-4? In your benchmarks

Asking because I was using gpt-4-1106-preview when all of a sudden it got ~2x slower and following instructions poorly enough that I had to switch to the regular gpt-4

from aider.

batmanscode commented on May 29, 2024

On benchmarks gpt-4-1106-preview seems to do better, but my sense is that gpt-4-0613 might actually be more capable at complex coding.

Very interesting, thanks

from aider.

jimcraner commented on May 29, 2024

@paul-gauthier:

(First, aider is awesome -- kudos to you for developing it! And thanks for releasing it! :-)

Second: is there a good way for us to provide data for you about this reliability problem?

For 0.18, using 4-turbo and the SEARCH/REPLACE model, aider worked great (Django, standalone python, Laravel, HTML/CSS, HTMX, vanilla JS)

Since upgrading to 0.19, and now 0.20, using the diff edit model, I have had zero successful edits. I've tried on multiple Django projects and a Laravel project.

I'm going to downgrade to 0.18 for now, but if there is any sort of useful information or data that we can provide to you, please let me know. I'd love to help improve the app!

Thanks again!

from aider.

paul-gauthier commented on May 29, 2024

@jimcraner thanks for the info on the problems you are having.

You can try the latest version of aider v0.21.0 which has some improvements to the unified diff editing format. Alternatively, you can always run aider with --model gpt-4-1106-preview --edit-format diff to use the old SEARCH/REPLACE edit format with gpt 4 turbo.

I would love any concrete examples you have of editing failures. To be most useful, I need:

Aider version, model, and edit-format settings. Ideally you can just copy all the "announcement" lines when you run aider which report all of this and other helpful info.
A copy of the diffs which failed to apply. You can find these in .aider.chat.history.md.
A copy of the source file that was being updated, or at least the chunk that is mentioned in the failing diffs.

from aider.

paul-gauthier commented on May 29, 2024

I'm going to close this issue for now, but feel free to add a comment here and I will re-open or file a new issue any time.

from aider.

New diff approach to making changes has not been very reliable. about aider HOT 10 CLOSED

Comments (10)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent