yesminebellalah Goto Github PK
Name: Yesmine Bellalah
Type: User
Bio: I'm a data scientist engineer and Big Data Master student at Paris 8.
Location: Paris
Blog: https://www.linkedin.com/in/yesmine-bellalah-72a572101/
Name: Yesmine Bellalah
Type: User
Bio: I'm a data scientist engineer and Big Data Master student at Paris 8.
Location: Paris
Blog: https://www.linkedin.com/in/yesmine-bellalah-72a572101/
We are going to study bicycle movements in Dublin city. We will retrieve the station data using the JCDeacaux API, store them in a document oriented, NO SQL database: MongoDB. Then study the static and dynamic data in this database, and we will predict for each station, the number of bicycles available per station, with LSTM neural network, to display at the end to the user, the stations that are close to him with the approximate number of bicycles available in each station.
Modelized and predicted the income level for a person per year , i.e which people save more or less than $50,000 / year. I started with a basic algorithm: logistic regression, then added the SMOTE technique to deal with the high imbalance in the data. I tried other algorithms based on decision trees ( such as Ada boost, Random Forest, Gradient boosting) , and Naive Bayes classifier. To assess and compare the performance of the classifiers, I used precision, recall, ROC auc, accuracy and confusion matrix .
- Collected the data (credits of 2016 ) , Identified low quality data , Imputed missing values ,Performed descreptive statistics and Visualized the data using ggplot2, rbokeh, plotly packages in R.
Recent Methods from Statistics and Machine Learning for Credit Scoring When a bank receives a loan application, based on the applicant’s profile the bank has to make a decision regarding whether to go ahead with the loan approval or not. To minimize loss from the bank’s perspective, the bank needs a decision system regarding who to give approval of the loan and who not to. 1) Develop classification model using Logistic Regression (LR) 2)- Assess the performance of the model using the confusion matrix and the ROC curve
In this project, we estimated a stochastic-process-based ( verifiying a Stochastic differential equation) expected value of a given function using numerical methods namely Monte Carlo method, and we tried to reduce the variance of the estimator using the antithetic method, control variates method and stratification method.
Multi-class news classification based on description into 42 categories
Performed a principal component analysis on a data set that treats patients who have had a stroke in order to follow their condition following the accident and compare assessment tools to evaluate their recovery.
Implemented Radial Basis Function Network (clusteringbased approach) with Python(Anaconda) and using it to perform both classification to resolve a credit scoring issue (on an imbalanced data set) , and regression for one-month-ahead and 4-month-ahead forecasting values relevant to Sales/Demands quantity of banking products.
Performed descriptive statistical study and classification of products according to their quality parameter using hierarchical clustering and preference mapping using R software
French stopwords collection
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.