Comments (13)
CC: @tdcmeehan @aditi-pandit @majetideepak
from presto.
CC: @feilong-liu @mlyublena @kaikalur
from presto.
it seems like the 6th parameter is marked false when it should be true. is that the only problem? or am I misunderstanding the issue?
from presto.
it seems like the 6th parameter is marked false when it should be true. is that the only problem? or am I misunderstanding the issue?
This is the only issue.
from presto.
Should return same result across multiple runs.
Why it should return the same result? arbitrary
is supposed to be ok to return any values right?
from presto.
@feilong-liu The expectation is that arbitrary(x ORDER BY x)
would always return same result, no?
from presto.
@feilong-liu The expectation is that
arbitrary(x ORDER BY x)
would always return same result, no?
Not sure about this. I think when the user use arbitrary function, it already specifies the intention that an arbitrary value is good.
from presto.
@feilong-liu The expectation is that
arbitrary(x ORDER BY x)
would always return same result, no?
No arbitrary definitely does not return the same result. ORDER BY is done in a very hacky way because its expensive (prevents partial agg :() so we do some filtering in the plan to say certain operations are not affected by order by (in fact we use allowlist). So only things like array_agg etc. respect it.
from presto.
see:
That uses implementation property to do planning!
from presto.
Here is what Presto documentation says:
https://prestodb.io/docs/current/functions/aggregate.html
"Some aggregate functions such as array_agg() produce different results depending on the order of input values. This ordering can be specified by writing an ORDER BY Clause within the aggregate function:"
My reading of this statement suggests that any function whose results depend on order of inputs (arbitrary, min_by, max_by, etc.) can be forced to return same result if we add an ORDER BY.
Furthermore, I understand that evaluating aggregate functions on sorted inputs is expensive and it makes sense to skip ORDER BY if it is not needed. For example, the results of min/max don't depend on whether inputs are sorted, hence, it is safe to drop ORDER BY. However, the results of min_by, max_by, arbitrary do depend on the order of inputs, hence, it is not safe to drop ORDER BY.
from presto.
Can you help me to setup project locally
from presto.
Can you help me to setup project locally
@tdcmeehan @majetideepak Tim, Deepak, do we have instructions somewhere?
from presto.
@atikkk please read through CONTRIBUTING. If you have any questions please ask in #dev in our Slack channel (found from prestodb.io).
from presto.
Related Issues (20)
- Wrong results for count (and probably correlated subqueries) over empty grouping sets
- Wrong timezone conversions for "America/Scoresbysund" HOT 8
- presto memory usage mechanism problem, after the execution of the calculation task, the memory usage is not reduced HOT 1
- Not able to build presto project HOT 9
- [native] Possible SEGV in HttpClient::createTransaction HOT 10
- Add a UDF for splitting array into slices of given length
- Add fail-fast function validation support for Presto C++
- Allow C++-defined system session properties
- Differential privacy primitives use insecure noise generation HOT 2
- Missing pushdown
- Flaky SingleStore tests HOT 1
- CI job SingleStore tests failing HOT 1
- Add ARM64 Support for Building Prestissimo Docker Image on Mac M1 HOT 8
- Backport https://github.com/prestodb/presto/pull/22926 into 0.285, 0.286 and 0.287 HOT 4
- Pushdown (partial) rowNumber under join
- Flaky test: TestMemoryManager.testReservedPoolDisabledMultiCoordinator
- Add documentation for Geospatial types in main types page HOT 5
- For each agg function with input param as <T>, Add an equivalent agg function with input param as array<T> HOT 1
- [docs] Combine the descriptions of session property with configuration property for history based optimization
- singlestore-dockerized-tests job is failing often HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from presto.