1. Doing EDA(Exploratory Data Analysis) for the dataset and data pre-processing if necessary
2. Register and login to Kaggle platform (Kaggle is a free cloud Machine Learning Programming platform for people to program and publishing their solutions based on a given competition dataset) and you will find many published AI solutions on the Titanic survival dataset for that answers the question: “what sorts of people were more likely to survive?” using passenger data (e.g. name, age, gender, socio-economic class, etc). Published solutions can be found at [login to view URL]
3. Critically analysing a number of published solutions on the Kaggle platform to identify one Machine Learning (AI) model that you prefer to apply or your improved solution based on a published solution.
You can download the dataset from [login to view URL]
The training set should be used to build or evaluate the ML/AI models. For the training set, we provide the outcome (also known as the “ground truth”) for each passenger. Your model will be based on “features” like passengers’ gender and class. You can also use feature engineering to create new features.