# Stock-Market-Prediction This repository began as a **7th-semester minor project** and evolved into our **8th-semester major project**, **"Advanced Stock Price Forecasting Using a Hybrid Model of Numerical and Textual Analysis."** It utilizes **Python, NLP (NLTK, spaCy), ML models, Grafana, InfluxDB, and Streamlit** for data analysis and visualization.
 ## Project Description The **Advanced Stock Price Forecasting Using a Hybrid Model of Numerical and Textual Analysis** project involves a comprehensive approach to predicting stock prices using both numerical data and textual analysis. The project components include: 1. **Data Collection and Storage**: We gathered historical stock data of major companies and stored it in an InfluxDB database to efficiently handle large-scale time-series data. 2. **Data Visualization**: A Grafana dashboard has been set up for real-time visualization of stock prices and analysis results, enhancing data interpretation and decision-making processes. 3. **Textual Analysis for Enhanced Forecasting**: We utilized Natural Language Processing (NLP) libraries, such as NLTK and spaCy, to analyze financial news and reports. This component complements numerical analysis to improve the accuracy of our hybrid forecasting model. 4. **Machine Learning Models**: The project used models including Naive Bayes, MLP (Multi-Layer Perceptron), Logistic Regression, and Random Forest to process both numerical and textual data, creating a robust and comprehensive stock prediction system. 5. **Reddit Chatbot Data Visualization Integration**: The project involved adding static and interactive plots to represent chatbot data from Reddit, using Matplotlib and Seaborn to visualize user interactions, message frequency, and topic distribution effectively. 6. **Collaboration and Project Management**: The repository includes contributions from all team members with well-organized tasks, ensuring seamless collaboration and effective version control. ## Directory Structure ```markdown π Stock-Market-Prediction/ βββ π Codes/ # π§ Core code modules for analysis and app development β βββ π Historical_Data_Analysis/ # π Scripts for long-term market trend analysis β βββ π Partial_Data_Analysis/ # π Focused short-term or segmented data analysis β βββ π Ticker_Symbols_Stocks/ # πΉ Scripts for retrieving and managing ticker symbols β βββ π Flask_App/ # π Web interface built using Flask framework β βββ π Conferences/ # π€ Presentation and academic conference materials β βββ π Documents/ # π Miscellaneous reports and supporting documents β βββ π Major_Project/ # π Final-year major project resources β β βββ π PPT/ # π½οΈ Presentation slides for the major project β β βββ π Proforma_&_Progress_Report/ # π Official progress reports and planning forms β β βββ π Thesis/ # π Final thesis document with research and results β β β βββ π Minor_Project/ # π― Minor-project materials β βββ π PPT/ # π§Ύ Slides prepared for minor project presentation β βββ π Proforma_&_Progress_Report/ # π Progress reports and planning forms for minor project β βββ π Thesis/ # π Final minor project report or thesis β βββ π Reference_Documents/ # π Research papers and helpful external references β βββ π Resources/ # π οΈ Datasets, libraries, and supporting tools β βββ π LICENSE # π Terms and conditions for usage and distribution βββ π README.md # π Overview, setup guide, and project introduction ``` ### π Thesis Reports > π Major & Minor Project Reports > Detailed thesis reports for both major and minor projects are available under their respective [`Thesis`](Documents/Thesis/) folders in [`Documents/`](Documents/). ### ποΈ InfluxDB Setup Guide > π Time-Series Data Storage & Integration > Step-by-step InfluxDB setup and data integration guide is available at [`Codes/Historical_Data_Analysis/InfluxDB/`](Codes/Historical_Data_Analysis/InfluxDB/). ### π Grafana Dashboard Guide > π Visualization Dashboard Setup > Grafana dashboard setup and InfluxDB connection guide is available at [`Codes/Historical_Data_Analysis/Grafana_Dashboard/`](Codes/Historical_Data_Analysis/Grafana_Dashboard/). --- ## Dataset Used | Company | Description | Data Range | Dataset Shape | Starting Stock Date | Current Stock Date | Starting Stock Price | Current Stock Price | |-----------------------------------|-------------------------------------------------------------------------------------------------|----------------------|---------------|---------------------|--------------------|----------------------|----------------------| |Madhurima Rawat | Geetanshu Dev Meshram | Sneha Jha | |
---|---|---|---|
![]() Role | Project Planner & Developer | Data Analyst & Backend Developer | Data Analyst |
![]() Responsibilities | Project planning, managing GitHub repo, docs, InfluxDB setup, Grafana dashboard, Streamlit & Flask, data viz & preprocessing | Model building for numerical data, Flask app design | Text data processing, model building, hybrid model creation |
![]() Tools | GitHub, InfluxDB, Grafana, Streamlit, Python, Flask, Pandas, Matplotlib, Plotly | Python, Flask, ML libraries | NLP libraries, ML libraries, hybrid modeling tools |
![]() GitHub | GitHub | GitHub | GitHub |
![]() | |||
![]() |