My name is ManojRam Mopati, a data-driven professional currently pursuing a Master of Science in Data Science at Coventry University, Big Data Analytics, Machine Learning and other Modules included.
I began my career at the service-based company Infosys Ltd in the comapny role of Software Quality Engineer, Where I initially worked as a client-facing role Regression Test Analyst for few days before transitioning into a client-facing role Data Analyst, where I worked with large-scale datasets to generate insights, automate reporting pipelines, and deliver high-impact dashboards.
My professional work includes building Power BI dashboards for 5 business units, performing SQL-based data extraction and transformation, conducting exploratory data analysis (EDA), and leveraging PySpark in Azure Databricks to streamline data processing workflows.
I have also developed end-to-end data science and machine learning projects in domains such as healthcare, finance, and real estate. These include predictive modeling, classification, recommendation systems, and deep learning applications—many of which are showcased in this portfolio.
I am passionate about data storytelling—translating complex datasets into insights that drive business decisions. I thrive in collaborative, agile environments and enjoy working closely with cross-functional teams.
I am currently seeking data science internships or part-time opportunities in the UK or Europe where I can contribute using my technical skills in SQL, Python, Power BI, and cloud analytics platforms like Azure.
Outside of work, I enjoy exploring new technologies, competing in data science challenges, and contributing to open-source projects. I’m committed to continuous learning and stay updated through online courses and workshops.
This portfolio reflects my passion and capability in analyzing data, building solutions, and communicating insights effectively. Each project is a step forward in my journey to becoming a well-rounded data professional.
Skills & Tools:
Python, SQL,data collection, data analysis, EDA, Statistical analysis, and visualization libraries, Power BI, Scikit-learn, TensorFlow, PySpark, Databricks, Microsoft Azure, Machine Learning, Deep Learning, NLP, Time series, Azure DevOps.
I also have experience with cloud platforms like GCP, Streamlit Community Cloud, AWS, and Azure for model deployment and managing data pipelines.
📊 Feel free to explore my github code repos and get in touch if you’re looking for a dedicated Data Science intern ready to make an impact!
As a Data Analyst, I played a key role in transforming raw insurance and policy data into actionable business insights by collaborating with Business Analysts, Scrum Leads, and product managers to meet evolving business requirements. I conducted in-depth exploratory data analysis (EDA), built SQL-based reports, and designed interactive dashboards using Power BI that empowered business teams to make informed, data-driven decisions. My work involved automating reporting pipelines using Azure Databricks and PySpark to enhance the efficiency of data processing tasks, especially for large-scale claim and policy datasets (10M+ rows). I also developed Python scripts for data transformation, anomaly detection, and analytical modeling to support strategic decisions. Additionally, I was responsible for creating and maintaining clear documentation of data processes, validation rules, and business logic to ensure transparency, reusability, and audit-readiness. I actively participated in Agile/Scrum ceremonies—including sprint planning, reviews, and retrospectives—which enabled collaborative progress tracking and iterative delivery of analytics solutions. These contributions collectively led to improved data visibility, a 50% reduction in manual reporting efforts, faster reporting cycles, and more consistent stakeholder engagement through high-quality, insight-driven deliverables.
Tools:Microsoft Power BI, Oracle SQL Developer, Pyspark, Python, Azure Databricks, Azure Devops, SQL, Excel, Agile.
Developed a production-ready machine learning web application that predicts customer churn for telecom companies with 99%+ accuracy. Built an end-to-end MLOps platform featuring real-time predictions, automated deployment, and comprehensive data analysis to help businesses proactively identify at-risk customers and implement retention strategies. The application combines advanced machine learning algorithms with modern web technologies and cloud infrastructure for scalable, reliable performance.
Tools: Python, FastAPI, Scikit-learn, XGBoost, LightGBM, Pandas, NumPy, Docker, Docker Hub, Azure Container App, CI/CD Pipelines, GitHub Actions, Postman, Jupyter Notebook, Matplotlib, Seaborn, Pytest, Jinja2, Pydantic, Git, HTML.
Developed a robust deep learning application to classify potato leaf diseases from image data, empowering farmers and agricultural professionals with early detection and effective disease management. This project leverages advanced computer vision and neural network techniques to streamline the identification of common potato diseases (such as early blight and late blight) from uploaded images, providing fast and accurate results for practical field use.
Tools:Python, TensorFlow, Keras, OpenCV, Numpy, Pandas, Scipy, Matplotlib, Seaborn, Scikit-learn, Streamlit, FastAPI, Uvicorn, Jupyter Notebook, Request, Threading, Visualization, Deep Learning, CNN, Streamlit Community Cloud.
Developed a machine learning application that predicts loan approval outcomes based on applicant data, assisting financial institutions with risk assessment and streamlined decision-making. Leveraging advanced data processing techniques and predictive modeling, this project automates the evaluation of loan applications using key financial and demographic features, providing rapid and reliable approval predictions for practical business use.
Tools:Jupyter Notebook, Python, Scikit-Learn, NumPy, Pandas, Matplotlib/Seaborn, Data Visualization, Machine Learning, Streamlit Community Cloud.
Developed a machine learning project designed to estimate gemstone prices based on their characteristics such as carat, depth, table, dimensions, cut, color, and clarity.The project features a complete data pipeline, from data ingestion and preprocessing to advanced model training and deployment.
Tools:Data Ingestion, EDA, Visualization, Probability&Statistics, Feature Engineering, Feature Scaling, Feature Selection, Data Preprocessing, Python Libraries, Machine Learning, AWS Cloud Deployement.
Developed a machine learning web application designed to estimate house prices in the Boston area based on multiple property features. Leveraging regression techniques and an ensemble of advanced models (including CatBoost, XGBoost, and KNN), this project guides users through predicting home values by inputting relevant attributes such as crime rate, number of rooms, tax rate, and more.
Tools:Data Ingestion, EDA, Data Preprocessing, Visualization, Python Libraries, Machine Learning, Streamlit Community Cloud.
Developed a deep learning application that predicts breast cancer from medical data, assisting healthcare professionals with early diagnosis and effective patient management. Leveraging advanced machine learning techniques and neural networks, this project streamlines the process of identifying the likelihood of breast cancer based on clinical features, providing rapid and accurate predictions for practical healthcare use.
Tools:Jupyter Notebook, Python, TensorFlow/Keras (or PyTorch), Scikit-Learn, Data Visualization, NumPy, Matplotlib/Seaborn, Machine Learning, Deep Learning, Streamlit Community Cloud.
Developed a machine learning-based application to predict student academic performance, enabling educators and institutions to identify at-risk students early and implement targeted interventions. By leveraging advanced data analytics and predictive modeling, this project streamlines the evaluation of student outcomes based on a variety of academic and socio-demographic factors, facilitating data-driven decisions for improved educational management and student success.
Tools: Jupyter Notebook, Python, Scikit-Learn, NumPy, Pandas, Matplotlib/Seaborn, Machine Learning, Data Visualization, Streamlit Community Cloud.
Want to know more about my job and education? Download my resume to get the complete details.
Download ResumeFeel free to reach out to me through the following platforms: