Coder Social home page Coder Social logo

creditcard-egd's Introduction

๐Ÿ’ณ Credit Card Fraud Detection with PySpark and Dataproc

Welcome to the exciting world of credit card fraud detection! This project uses PySpark and Google's Dataproc to build a model that can a ccurately identify fraudulent transactions and prevent them from happening.

๐Ÿ“Š Dataset

Our dataset comes from Kaggle and consists of over 285,000 credit card transactions, out of which only 492 are fraudulent. The data has been anonymized to protect privacy, but we can still use it to train our model and detect fraud with impressive accuracy.

๐Ÿš€ Getting Started

To run this project, you'll need to have PySpark, Jupyter Notebook, and Google's Dataproc installed. If you don't already have them, don't worry! It is easy to put all in motion.

๐Ÿ† Results

We're happy to report that our final model achieved an F1 score of 0.82, with an accuracy of 99.93%. That means we were able to detect 73% of all fraudulent transactions, with a false positive rate of only 0.01%.

๐Ÿค– Conclusion

In conclusion, this project showcases the power of PySpark, Google's Dataproc, and machine learning in the fight against credit card fraud. It was a challenging task, with an imbalanced dataset and a high-stakes goal, but we rose to the challenge and built a model that can make a real-world impact.

So, what are you waiting for? Join us on this exciting journey of fraud detection and prevention, powered by Google's Dataproc!

creditcard-egd's People

Contributors

sandokhan avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.