Topic: parquet-files Goto Github
Some thing interesting about parquet-files
Some thing interesting about parquet-files
parquet-files,Managing large data sets projects (Data Science)
User: 13caroline
parquet-files,A converter for the OSM PBFs to Parquet files
User: adrianulbona
Home Page: http://adrianulbona.github.io/2016/12/18/osm-parquetizer.html
parquet-files,Apache Spark application to get the top ten frequent routes and profitable areas
User: adrigrillo
parquet-files,A fast and simple command-line (CLI) tool to convert a Parquet file to an Apache Arrow file
User: alexkreidler
parquet-files,Data Engineering project on how to build Data Lake on S3 using Chicago Taxi Dataset
User: bdnf
parquet-files,A simple library and console application to illustrate how to read and load data into class models from Parquet files saved to Azure Blob Storage using Parquet .Net (parquet-dotnet). This is useful for E-L-T processes whereby you need to load the data into Memory, Sql Server (e.g. Azure SQL), etc. or any other location where there is no built-in or default mechanism for working with Parquet data.
User: cajuncoding
parquet-files,Threat Detection and Visualization
Organization: datatech-solutions
parquet-files,A command line tool for inspecting parquet files with PyArrow.
User: domvwt
parquet-files,ETL job with AWS Glue
User: dorianteffo
parquet-files,
User: ensleyec
parquet-files,Upstream classifier image preprocessing
User: etiennelardeur
parquet-files,:bangbang: Handle Big Data for Machine Learning using Python and PySpark, Building ETL Pipelines with PySpark, MongoDB, and Bokeh
User: foroozani
Home Page: https://github.com/apache/spark
parquet-files,Udacity Data Engeneering Nanodegree Program - My Submission of Project: Data Lake
User: futuretroglodyte
parquet-files,Scala code to read Parquet files as streams in Spark Streaming using Avro.
User: gpapag
parquet-files,Library to read a subset of Parquet files
User: hannes
parquet-files,:guardsman: Tools to Transform and Query Data with 'Apache' 'Drill'
User: hrbrmstr
Home Page: https://hrbrmstr.github.io/sergeant/
parquet-files,:guardsman: ☕️ Tools to Transform and Query Data with 'Apache' 'Drill'
User: hrbrmstr
parquet-files,A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardless of their size.
User: ignaciomb
Home Page: https://pypi.org/project/csvcli/
parquet-files,OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into partitioned by H3 regions PostGIS pgsnapshot (lossless) OSM schema representation and/or into ArrowIPC/Parquet dumps
User: igor-suhorukov
Home Page: https://habr.com/en/post/717408
parquet-files,Proyecto Individual de MLOps sobre deployar una API de videojuegos de la plataforma Steam.
User: ingcarlapezzone
Home Page: https://pi1-games.onrender.com/
parquet-files,Glue Data Quality Example - Deploy to your AWS Account w/ Terraform to test
User: jaredfiacco2
parquet-files,Processes S3 Inventory Manifests and generates a report about the folder size and object size average
User: johnbrandborg
parquet-files,Merge Parquet Files on S3 with this AWS Lambda Function
User: m-kwiedor
parquet-files,Processing and exporting data from EPW files into other formats.
User: matbbastos
parquet-files,Daily scraps the data from rpi-imager-stats
User: matt40k
parquet-files,Project on MapReduce for the Μ111 - Big Data Management course, NKUA, Spring 2023.
User: mdarm
parquet-files,ETL pipeline that transforms JSON files from AWS S3 bucket to Parquet files also in S3 bucket
User: milamarcan
parquet-files,Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
User: mjakubowski84
Home Page: https://mjakubowski84.github.io/parquet4s/
parquet-files,MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
Organization: mongodb-labs
Home Page: https://mongo-arrow.readthedocs.io
parquet-files,A docker image to read parquet files with drill in DataGrip
User: mschermann
parquet-files,Node-Red contrib that converts between a PARQUET string and its JavaScript object representation, in either direction.
User: msigrupo
parquet-files,Simple utility package to convert EDF/EDF+ files into Apache Parquet format.
User: narayanschuetz
parquet-files,UniParc dataset describing ~300 million protein sequences converted into relational tables accessible through Google BigQuery (and as Parquet files).
User: ostrokach
Home Page: https://gitlab.com/ostrokach/uniparc_xml_parser
parquet-files,Simple and small CLI to work with parquet files
User: otaviohenrique
parquet-files,Query and transform data with PRQL
Organization: prql
parquet-files,Converts between file formats such as CSV and Parquet
User: renesugar
parquet-files,Load data from the Million Song Dataset into a final dimensional model stored in S3.
User: rigganni
parquet-files,A Quarto notebook requesting a parquet file stored in S3
User: rlesur
Home Page: https://rlesur.github.io/quarto-ojs-parquet-s3/
parquet-files,Streaming kafka events using Spark in avro format and saving the events in parquet format
User: rupeshtiwari
parquet-files,A summative coursework for CSC8101 Engineering for AI
User: srking501
parquet-files,A lightweight Java library that facilitates reading and writing Apache Parquet files without Hadoop dependencies
Organization: strategicblue
parquet-files,Explore factors associated with Malware Infection using Spark SQL
User: sudip-padhye
parquet-files,Streaming data of Tiki with Kafka and processing with Spark, visualize with Elasticsearch & Kibana.
User: trannguyenhan
parquet-files,Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Organization: uber
parquet-files,Price Crawler - Tracking Price Inflation
User: uhussain
parquet-files,A web application for viewing Apache Parquet files . This is a Python + Flask application
User: vipinc007
parquet-files,Help you to visualize hadoop file formats.
User: yaphet17
parquet-files,create files which formats are like "orc", "parquet", "xlsx", "json" and so on with Python
User: yo-mah-ya
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.