Comments (4)
To address the discrepancies between the commits table and GitLab data, especially after merge operations, it's crucial to dive deeper into the specifics of your situation. Here's how we can proceed:
-
Share the Python Script: The script you used for verification could provide insights into how data is being fetched and compared. This will help in understanding the methodology behind your findings.
-
Identify Patterns: Knowing if these discrepancies follow any specific patterns, such as occurring more with larger commits or certain file types, can help in pinpointing the cause.
-
Provide Examples: Examples of commit hashes with their discrepancies in additions count between the commits table and GitLab would be invaluable for a detailed investigation.
-
Scope of the Issue: Clarifying whether this issue is isolated to specific repositories or is more widespread will aid in diagnosing the problem more effectively.
This information will be instrumental in uncovering the root cause of the discrepancies you're observing.
Details
About Dosu
This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.
Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.
To continue the conversation, mention @dosu.
from incubator-devlake.
Thanks for reporting.
from incubator-devlake.
It appears the libgit2
library we're using has a different diff algorithm compared to GitLab. Unfortunately, this is likely an expected behavior rather than a bug and wouldn't be easily configurable.
from incubator-devlake.
Hi, we checked the logic. The difference between additions from GitLab APIs and DevLake is because:
- DevLake doesn't use GitLab APIs, but the
gitextractor
plugin to collectcommits
- The
gitextractor
plugin uses thelibgit2
library to calculate the commits additions and deletions. - The logic of calculating the additions in
gogit
andGitLab APIs
are slightly different.
Thus, it's a problem that can not be addressed in DevLake for now. We can't switch gitextractor
to GitLab APIs
or GitHub APIs
or Bitbucket APIs
or Azure DevOps APIs
to collect commits (Git) data, as it will increase the collection time by 10x times.
from incubator-devlake.
Related Issues (20)
- [Bug][Jira] A issue has been deleted in Jira,but devlake didnot delete it in devlake mysql,so when execute config-ui collect data ,the issue still exist HOT 4
- [Bug][AzureDevOpsPlugin] Unauthorized request HOT 8
- [Feature][Keycloak] Can't add SSO in Apache Devlake.
- [Bug][JiraPlugin] Failing on "collectIssues" subtask HOT 9
- [Feature][Convertor] Add data processing for Issue tracking to compute issue status and assignee history HOT 1
- [Bug][Config UI] I can still add an existing scope in AzureDevops
- [Feature][GitLab] Add assignees and reviewers to table.pull_requests
- [Bug][Module Name] my data doesnt show on the dashboard can someone help me please HOT 3
- [Refactor][DORA] Refactor CalculateChangeLeadTimeMeta subtask HOT 2
- [Feature][PagerDuty] Change Incident Times HOT 11
- [Bug][PagerDuty] not enough info for Pagerduty execution (400) HOT 3
- [Bug][Module Name] AxiosError: Request failed with status code 500 HOT 2
- [Bug][CircleCI] CircleCI regex is not working to collect data HOT 3
- [Feature][StarRocks] support starrocks replication num
- [Feature][CustomizePlugin] Data extraction 'rawDataParams' does not support wild cards
- [Bug][GitLab] account ID is not normalized HOT 1
- [Bug][SonarQube] subtask convertIssueCodeBlocks ended unexpectedly HOT 3
- [Bug][Sonarqube] data too long HOT 6
- [Bug][opsgenie] Testing API Key not working HOT 4
- [Bug][API] GET /projects exposes GitLab token in the gitextractor plugin HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from incubator-devlake.