Comments (10)
Thanks for trying aider and filing this issue. No editing format is going to work 100% reliably with any of the GPT models. I have been using a pair of extensive benchmarking suites to try and make informed, quantitative decisions when implementing and improving the editing formats. But even so, sometimes the LLM will mess up.
So please keep me posted on problems you are seeing. They might hint at possible ways to improve the editing format.
from aider.
@carlos-dc what LLM are you using? Just curious
I am using gpt-4-1106-preview
from aider.
On benchmarks gpt-4-1106-preview
seems to do better, but my sense is that gpt-4-0613
might actually be more capable at complex coding.
from aider.
@carlos-dc what LLM are you using? Just curious
from aider.
@carlos-dc thanks. Personally I've noticed a drop in quality with the turbo preview. In addition to an increased response time
from aider.
That's a great approach @paul-gauthier, appreciate the effort and that's probably the best one can do
Out of curiosity, how has the new turbo preview model been compared to gpt-4? In your benchmarks
Asking because I was using gpt-4-1106-preview when all of a sudden it got ~2x slower and following instructions poorly enough that I had to switch to the regular gpt-4
from aider.
On benchmarks
gpt-4-1106-preview
seems to do better, but my sense is thatgpt-4-0613
might actually be more capable at complex coding.
Very interesting, thanks
from aider.
(First, aider is awesome -- kudos to you for developing it! And thanks for releasing it! :-)
Second: is there a good way for us to provide data for you about this reliability problem?
For 0.18, using 4-turbo and the SEARCH/REPLACE model, aider worked great (Django, standalone python, Laravel, HTML/CSS, HTMX, vanilla JS)
Since upgrading to 0.19, and now 0.20, using the diff edit model, I have had zero successful edits. I've tried on multiple Django projects and a Laravel project.
I'm going to downgrade to 0.18 for now, but if there is any sort of useful information or data that we can provide to you, please let me know. I'd love to help improve the app!
Thanks again!
from aider.
@jimcraner thanks for the info on the problems you are having.
You can try the latest version of aider v0.21.0 which has some improvements to the unified diff editing format. Alternatively, you can always run aider with --model gpt-4-1106-preview --edit-format diff
to use the old SEARCH/REPLACE edit format with gpt 4 turbo.
I would love any concrete examples you have of editing failures. To be most useful, I need:
- Aider version, model, and edit-format settings. Ideally you can just copy all the "announcement" lines when you run aider which report all of this and other helpful info.
- A copy of the diffs which failed to apply. You can find these in
.aider.chat.history.md
. - A copy of the source file that was being updated, or at least the chunk that is mentioned in the failing diffs.
from aider.
I'm going to close this issue for now, but feel free to add a comment here and I will re-open or file a new issue any time.
from aider.
Related Issues (20)
- Refactor legacy codebase HOT 3
- [Workflow] fzf file selection HOT 4
- FR: add docstrings, types and tests to udiff_coder.py HOT 2
- Preventing Diff in Commit Messages HOT 1
- Wrong project name for pipx in FAQ HOT 1
- Incorrect prompt to "Add these files to the chat?" HOT 2
- Command line paramater to specify test command HOT 6
- When git user and email are not set, don't dump a bunch of useless info to the screen HOT 5
- Error - Could not run aider on existing git repo HOT 7
- Support tabs in the prompt HOT 8
- Abstract LLM interfaces and Claude 3 Opus HOT 9
- Enhance error message for missing API Key HOT 3
- Aider very slow ( on large repos ) HOT 2
- Aider Researcher HOT 3
- Extending Aider - Senior Aider AI HOT 4
- Add files based on content HOT 2
- Add deepseek-coder api HOT 5
- Bug when starting aider HOT 9
- [BUG] aider can cause data loss HOT 8
- Missing tags.scm abort execution of repo map HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from aider.