bushra-ansari Goto Github PK
Name: Bushra Ansari
Type: User
Bio: Data Analysis Enthusiast | DTU Alum
Location: New Delhi
Name: Bushra Ansari
Type: User
Bio: Data Analysis Enthusiast | DTU Alum
Location: New Delhi
Given the details of cell nuclei taken from breast mass, objective is to predict whether or not a patient has breast cancer using the Ensembling Techniques. This is a classification problem. The dataset consists of several predictor variables and one target variable, Diagnosis. The target variable has values 'Benign' and 'Malignant', where 'Benign' means that the cells are not harmful or there is no cancer and 'Malignant' means that the patient has cancer and the cells have a harmful effect.
American Sign Language (ASL) is a complete, natural language that has the same linguistic properties as spoken languages, with grammar that differs from English. ASL is expressed by movements of the hands and face. It is the primary language of many North Americans who are deaf and hard of hearing and is used by some hearing people as well. This Computer Vision project is done by OpenCV Python Package.
This project has been done by using OpenCV package in Python. The objective is to train the computer to read the license plates whenever the computer camera encounters a vehicle and store the pictures of only the license plate to keep a track of all the vehicles. It involves the concept of Image Recognition.
Decision Tree is a supervised learning algorithm. The given problem is a classification problem. The data set consists of various predictors and a target variable - Outcome. Objective is to predict whether a person is diabetic or non-diabetic.
Given the e-commerce data, k-means clustering algorithm is used to cluster customers with similar interest. The data was collected from a well known e-commerce website over a period of time based on the customer’s search profile.
One of the most important tasks for any retail store company is to analyze the performance of its stores. The main challenge faced by any retail store is predicting in advance the sales and inventory required at each store to avoid overstocking and under-stocking. This helps the business to provide the best customer experience and avoid getting into losses, thus ensuring the store is sustainable for operation.
This project is based a classification algorithm i.e. KNN which is run on a mobile dataset consisting of 2000 rows and 15 columns. It is a multi-class problem where mobiles phones are classified in accordance with their price range.
This project is based on a classification algorithm i.e. Naive Bayes which is run on a mobile dataset consisting of 2000 rows and 15 columns. It is a multi-class problem where mobile phones are classified in accordance with their price range. There are four classes of price ranging from 0 to 3, 0 indicating cheaper mobiles phones and 3 representing expensive mobile phones. Univariate analysis is conducted to understand individual predictors and bivariate analysis is conducted to infer relationship between predictors with other predictors and target variable. Important features are identified by Random Forest.
Support Vector Machine Classification model is applied on bank dataset containing 41188 rows and 21 columns. The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was required, in order to assess if the product (bank term deposit) would be ('yes') or not ('no') subscribed.
This project is based on House Prices dataset which consists of 2073 rows and 81 columns. Objective is to predict the price of the houses by a Regression Technique called Linear Regression after performing Principal Component Analysis.
This project is based on a supervised learning algorithm called Random Forest Regressor. The dataset consists of approximately one lakh rows and seven columns. Target variable is fare_amount.
Given the google rating data, hierarchical clustering algorithm is used to cluster reviews. This data set is populated by capturing user ratings from Google reviews. Reviews on attractions from 24 categories across Europe are considered. Google user rating ranges from 1 to 5 and the average user rating per category is calculated.
Logistic Regression is a supervised learning algorithm. This project is based on a classification problem where the objective is to predict how likely it is that the people will take an H1N1 flu vaccine. The dataset consists of 34 predictors including target variable i.e. h1n1 vaccine.
In this project, 500 tweets from various news channel have been extracted from twitter and their sentiment analysis is done with the help of polarity score. Also, 2 sports news have been extracted from Time of India website along with captions, headlines and photos.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.