Topic: dataproc Goto Github
Some thing interesting about dataproc
Some thing interesting about dataproc
dataproc,GCP_Data_Enginner
User: anjijava16
dataproc,Collected data about from three sources, one opinion-based social media in twitter, research data in New York Times, and the third is the common crawl data for the same topic or key phrase, and from similar time periods. Processed the three data sets collected individually using classical big data methods like Map Reduce in Google Dataproc Clusters. And then compared the outcomes using popular visualization methods in tableau.
User: bhagyashrit
Home Page: https://buffalo.box.com/s/osi9xe7dmmyw274gbhxp3z8daphpzxwg
dataproc,gke with terraform, dataproc with terraform
Organization: cloudgear-io
dataproc,Working examples for some components on GCP, and instructions on how to run them.
User: cuong3
dataproc,Performance Observability for Apache Spark
Organization: dataflint
dataproc,Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and pipelines.
Organization: debussy-labs
dataproc,A search engine to query social media insights with political theme
Organization: dsc-umass
dataproc,Dataproc Customisable HA cluster debian-9 with zookeeper,kafka ,BigQuery and other tools/jobs with Terraform
User: dwaiba
dataproc,Under construction....
User: enr1que319
dataproc,Demonstration of Google Cloud Dataproc for running Spark jobs with Java
User: garystafford
dataproc,Demonstration of Google Cloud Dataproc for running PySpark jobs
User: garystafford
dataproc,Demonstration of Google Cloud Dataproc Workflow Templates
User: garystafford
dataproc,This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.
Organization: googleapis
Home Page: https://cloud.google.com/dataproc/
dataproc,An end to end demo of Google's Cloud data and analytic stack.
Organization: googlecloudplatform
dataproc,Dataproc Scala Examples is an effort to assist in the creation of Spark jobs written in Scala to run on Dataproc.
Organization: googlecloudplatform
dataproc,Trino Autoscaler on Dataproc automates the scaling of Dataproc cluster based on real-time resource utilization by Trino workloads
Organization: googlecloudplatform
dataproc,Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service
Organization: googlecloudplatform
dataproc,Creating an Inverted Index of words occurring in a large set of documents extracted from web pages using Hadoop MapReduce and Google Dataproc
User: imehrdadmahdavi
dataproc,A searchable collection of useful little pieces of code
User: j-sephb-lt-n
dataproc,Data Workflows with GCP Dataproc, Apache Airflow and Apache Spark
User: jaiswalanshul
dataproc,La empresa GreenMiles NYC Taxis está interesada en invertir en el sector de transporte de pasajeros con automóviles, con una visión de un futuro menos contaminado y ajustarse a las tendencias de mercado actuales.
User: leocorbur
dataproc,Digital Innovation One - Desafio GCP Dataproc. O desafio consiste em efetuar um processamento de dados utilizando o produto Dataproc do GCP. Esse processamento irá efetuar a contahem das palavras de um livro e informar quantas vezes cada palavra aparece no mesmo.
User: lucianocoelho-28
dataproc,Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
User: lynnlangit
Home Page: https://www.linkedin.com/learning/learning-hadoop-2
dataproc,KMU CS Hot Topics in Big Data
User: maengsanha
dataproc,Generando un proceso ETL con dataset de Amazon
User: magdielgutierrez
dataproc,Data Pipeline from the Global Historical Climatology Network DataSet
User: marcosmjd
dataproc,An educational project to build an end-to-end pipline for near real-time and batch processing of data further used for visualisation and a machine learning model.
User: marieeczy
dataproc,✈ A Spark-based ETL Pipeline for the OpenSky and OpenFlights Datasets
User: michailparaskevopoulos
dataproc,Collection of personal resources on Google Cloud
User: mr-ubik
dataproc,ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipeline ― Cloud Storage, Dataproc, PySpark, Cloud Spanner and Tableau
User: prakashdontaraju
dataproc,Creating a robust and scalable data pipeline on Google Cloud Platform (GCP) to monitor and analyze stock performance. Leveraging the power of GCP's data processing and storage services, a comprehensive solution has been built to efficiently collect, process, and visualize stock data.
User: quannguyen0103
dataproc,opens a chrome browser to a dataproc cluster
Organization: spotify
dataproc,Big data analysis of 'shared-world' cloud application.
User: teanlouise
Home Page: https://teanlouise.github.io/shared-world-data
dataproc,EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.
User: tharwaninitin
dataproc,Inventory value is also important for determining a company's liquidity, or its ability to meet its short-term financial obligations. A high inventory value can indicate that a company has too much money tied up in inventory, which could make it difficult for the company to pay its bills.
User: thunchanokbow
dataproc,Repositório para armazenar artefatos de um trabalho da disciplina de Computação Distribuída.
User: tiagosanti
dataproc,Project for the Data Engineering Zoomcamp by DataTalks.Club
User: toludaree
dataproc,Using PySpark for Tensorflow model inferencing on GCP Dataproc Cluster. Demo for PyCon Hong Kong Fall 2020 Presentation
User: vionwinnie
dataproc,Ambiente de treinamento para Dataproc e DeltaLake
User: vvalcristina
dataproc,Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag
User: wittline
Home Page: https://wittline.github.io/pyDag/
dataproc,
User: yukia3e
dataproc,Orchestration Dataproc serverless job with Airflow
User: zaivi
dataproc,DataTalksClub Data Engineering Zoomcamp Project
User: zy969
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.