Setup Guide
Pre-requisites :
Core Python
Pyspark
Druid
Kafka
MongoDB
System Requirements :
Java version 1.8. 0 or More
Python 3.6
Druid
Kafka
How to set up
First clone project
git clone https://github.com/shikshalokam/ml-analytics-service.git
Checkout latest version or repo
install python and pip
add alias python=python3.8 in ~/.bashrc file
Refresh with source .bashrc command [Try
python -V&&pip -Vto confirm the installation]Install virtual environment using
pip install virtual envcommand orsudo apt install python3.8-venvcommandCreate a virtual environment using the
python -m venv spark_venvcommandActivate virtual environment using source
spark_venv/bin/activatecommandInstall the required dependency using
sudo pip install -r requirements.txtUse the
pip listcommand to recheck if the dependencies are installed correctlyCreate a config.ini file and add config paths ( In the ml-analytics dir )