Coder Social home page Coder Social logo

teradata_challenge's People

Contributors

chrizzo1 avatar leaflettuce avatar mitchb63 avatar

Watchers

 avatar  avatar  avatar  avatar

teradata_challenge's Issues

List of visuals needed

  • charity nav ranking?
  • description of demo
  • state distr
  • edu / disab (inequalities)
  • states deep dive
  • branch and rank trends
  • likelihood and time by volunteer
  • volunteer (who)
  • visualize log reg

polish presentation

From rough slides Chris maps together lets review and finalize then gtfo ๐Ÿ˜

Decide how to count 'time to hire'

Split out from #3, we need to figure out how to track time to hire to answer business problem revolving around demo differences and hiring time #2.

As @mitchb63 mentioned, there are time_to_(color) variables in df_contact we can use for this.

Should time to blue be the dependent for this question? Or should we create one based on date user account was created to date turned blue? OR some other method?

Future Issues

For updated issues and tasks for this project, see Kanban taskboard.
Will not be update everything here. (Only code related tasks).

Reduce datasets

Once we get a grasp of the var's we need from #7, we could drop any unneeded columns from the tables to make working with them easier (and quicker) in Tableau/PBI/Py?etc...

Define volunteer effectiveness

I think we have a good start to the client demo and time in system business questions in #2 .

Does anyone see a way to identify which clients had volunteer assistance? Or any other way we can go about figuring volunteer 'effectiveness'? I assumed it would be a var in contact but I can't seem to find it. Any thoughts?

Create network diagram

Use summary skills 'qualifications' variable to map out a network diagram.

steps

  • filter out to only skills and hired columns
  • split by hired/not hired (maybe?)
  • create co-occurance matrix
  • use networkx to map
  • aesthetics
  • EZ, right andy ๐Ÿ˜ฌ

Text cleaning

Once data types are set from #1, does any text within the data need cleaned?
This should probably only be done on data/variables that chosen business questions will require to save time.

For example:

  • input errors such as 'lieutenant' v 'leutenent' (misspellings)
  • remove unneeded sections, '01 campaign advisor' (remove 01)
  • splitting values, 'LA - March' into 'LA' and 'March'

Word cloud

Create word cloud of summary skills - hired vs. not hired.

Find Variables of Interest

It could be beneficial to get a list of the variables that will be used to solve the business question listed at #2

Anyone have thoughts on best route for this? We could keep a simple txt doc in the main dir listing them out, or add them to the EDA summary excel sheet maybe?

Once we have this it might be worth creating a custom dataset or two that drops all columns aside from these of interest to keep analysis less complicated.

Join datasets

Once questions are decided #2 and cleaning done #3 we should join the datasets as needed for analysis.

Finalize business solutions

Narrow focus of how to solve business questions. Though sharing EDA and discussion best route for presentation. To occur 3/30.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.