Swaviman Kumar's Projects
Authourship attribution with naive bayes using word and ngram
EDA on breast cancer dataset
A repository to learn basic data processing techniques (Wikipedia processing, feature selection), and use them for some basic Web query classification.
Employing statistical techniques, conduct a preliminary prognosis of Hypertension/hypotension, based on the level of hemoglobin and genetic history of the individual.
This simple project is focused on Computational Gastronomy & combines elements of data analysis, natural language processing, and information extraction. The assignment covers several tasks that involve working with recipe data and analyzing it using Python.
This project aims to conduct a random survey design for collecting responses regarding wine preferences of Italian consumers. Furthermore, it attempts to understand how preference share gets affected as we vary different attributes associated with wine with the use of a research method called Conjoint Analysis..
Our study focused on using the Big Five personality inventory to predict traits from students' smartphone sensor data collected over 2 months under the Horizon Europe project. Through correlation analyses and machine learning with cross-validation, we showed that predictions are reliable and accurate enough for practical use.
Perform Extensive Exploratory Data Analysis, apply three clustering algorithms & apply 3 classification algorithms on the given stroke prediction dataset and mention the best findings.
This is a little description about my new hello world prog. Thank you for reading this.
Apply machine learning to find top 10 similar images from a gallery folder given a query image.
My Github Readme page
This is a tutorial that covers the basics of NLP. We will cover few rudimentary operations such as tokenization, stemming etc. Happy learning.
An experiemental project to utilize LangChain and extract information from PDFs, utilizing OpenAI Text Embeddings.
End to end Business Intelligence Solution analyzing product, sales, finance and customer data. Showases use of advanced MS Power BI concepts, nuances of pivot, hierachy, snowflake schema, DAX and many more.
This is a tutorial that covers the basics of PySpark. We will cover few rudimentary dataframe operations such as withColumn function, when, otherwise etc. Happy learning.
A therapy recommender system to suggest best suitable treatments for patients based on their past medical records and other patients treatment record.
Developed an SQL Server data warehouse with a 'Production' schema, enabling PowerBI reporting and sales KPI analysis.
The purpose was to study the mood of respondents, what are the predictors of mood among students with different personality types and how do these predictors vary between different time diaries.
The idea behind SRC is that, instead of using a single feature vector to represent an input, multiple sparse representations are used, each one capturing a different aspect of the input.
Predicting Alzheimer from people's writing. Technique: SVMs.
Univariate and Multivariate Analysis performed on the Titanic Dataset