Coder Social home page Coder Social logo

chatbot-kafka-consumer's Introduction

chatbot-kafka-consumer

Introduction

Red Hat OpenShift Data Science, a managed cloud platform for Data Scientists and Developers of intelligent applications, supports the full Machine Learning lifecycle by providing a robust, scalable platform and a flexible, interactive environment for teams to do their work. Starburst Galaxy, based on open source Trino (formerly PrestoSQL), is a managed service providing a single point of access to your data without having to move it to a central repository. Starburst Galaxy focuses on the first, and often most difficult problem teams face when starting a new project - The Acquisition and Preparation of Data.

Figure: Machine Learning Lifecycle

MLLC

Figure: High level view of Starburst's Data Consumption Layer Consumption view

Demonstration

This demonstration will illustrate how quickly a Data Scientist can join streaming data from a Customer Support Chatbot to both the Customer and Financial data domains in relational databases. This process is straightforward thanks to Starburst Data's data consumption layer abstractions.

High Level Architecture

Data source view

Requirements

Please see chatbot-env-setup for instructions to create a comparable demo environment.

You will need:

Getting Started

You will need an environment setup before getting started (see requirements section).

Log into your Red Hat OpenShift Data Science environment and select an appropriate notebook image. Be sure to include the following environment variables (unique to your environment):

  • KAFKA_BOOTSTRAP_SERVER
  • KAFKA_SECURITY_PROTOCOL
  • KAFKA_SASL_MECHANISM
  • KAFKA_USERNAME
  • KAFKA_PASSWORD
  • TRINO_USERNAME
  • TRINO_HOSTNAME
  • TRINO_PORT

Here's a quick highlight of the notebooks:

  • 0_kafka_prepopulate.ipynb: connect to kafka and prepopulate the queue with randomized customer numbers and support requests.
  • 1_kafka_producer.ipynb: simple "chatbot" to illustrate sending single messages
  • 2_kafka_consumer.ipynb: simple consumer listening for our hello world exercise in the previous notebook
  • 3_trino_explore.ipynb: connect to Starburst, pull and explore your data.
  • 4_build_features.ipynb: build features for use in future models.
  • 5_nltk.ipynb: exercise for the reader.

chatbot-kafka-consumer's People

Contributors

keklundrh avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.