I'm a machine learning engineer.
- ๐ฑ Iโm currently learning Rust, and using Go and Python in my daily work.
- ๐ซ How to reach me: twitter
- ๐ Pronouns: He/His
- โก Fun fact: I enjoy playing with guitar and composed some original songs myself.
Python binding of MurmurHash3 Rust implementation.
License: BSD 3-Clause "New" or "Revised" License
I'm a machine learning engineer.
The clippy test shows the errors below, which is not expected:
error: deref on an immutable reference
--> src/lib.rs:80:34
|
80 | fn hash128_x[64](https://github.com/tushushu/murmurust/runs/7928640580?check_suite_focus=true#step:6:65)(_py: Python, key: &str, seed: u32, signed: bool) -> Py<PyAny> {
| ^ help: if you would like to reborrow, try removing `&*`: `&`
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#borrow_deref_ref
This is going to be fixed by PyO3/pyo3#2555. Currently, I have to disable the clippy errors, and when PYO3 0.17.0 released, we need change it back.
User can decide whether use signed int(i32) or unsigned int (u32) as hash32
function return type. For example:
>>> import mmh3
>>> mmh3.hash('foo', signed=False)
4138058784
>>> mmh3.hash('foo', signed=True)
-156908512
>>> mmh3.hash('foo')
-156908512
mmh3
repo link is https://github.com/hajimes/mmh3mmr3/tests
path.Need to implement hash128
function by Rust, and provide a Python wrapper. For the details, please refer to how we implement hash32
in this repo.
Run the github workflow - benchmark, and paste the report in readme.md
.
The repo link of mmh3
is https://github.com/hajimes/mmh3, and it's a similar project to us, which is implemented by C++ and Python.
Rename mmh3_32
as hash32
to make it convenient to use.
Can refer to project such as hyperjson
.
Add badges as below:
Please refer to my another repo. https://github.com/Rust-Data-Science/ulist
Please refer to my another repo https://github.com/Rust-Data-Science/ulist/blob/main/.github/workflows/publish.yml
By using SIMD
, inline
, unsafe
or any technique to improve the benchmark of this lib.
Disable the array bound check by using unsafe method or iterators. Related to #6
The default seed of hash32 function is zero, for example:
>>> import mmr3
>>> mmr3.hash32('foo')
4138058784
>>> mmr3.hash32('foo', 0)
4138058784
By calling Numpy C-API, implement hash functions for Numpy array. For example:
import numpy as np
import mmr3
arr = np.array(['foo', 'bar', 'baz'], dtype=object)
mmr3.hash32(arr)
Here is the Rust bindings for Numpy C-API: https://github.com/PyO3/rust-numpy
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.