What's Cooking?
Use recipe ingredients to categorize the cuisine
This task was published on Kaggle Competition. Please read details on original source.
This playground competitions asks you to predict the category of a dish's cuisine given a list of its ingredients.
Data Description
In the dataset, we include the recipe id, the type of cuisine, and the list of ingredients of each recipe (of variable length). The data is stored in JSON format.
An example of a recipe node in train.json:
{
"id": 24717,
"cuisine": "indian",
"ingredients": [
"tumeric",
"vegetable stock",
"tomatoes",
"garam masala",
"naan",
"red lentils",
"red chili peppers",
"onions",
"spinach",
"sweet potatoes"
]
},
File descriptions
- train.json - the training set containing recipes id, type of cuisine, and list of ingredients
- test.json - the test set containing recipes id, and list of ingredients
- sample_submission.csv- a sample submission file in the correct format
Possible Solution
Applying Decision trees for prediction. Apache Spark MLlib provides Decision trees algorithm.