This is a MultiLabelClassification Problem(solved) by Python A JSON file contains a group of labeled/tagged articles. Each article has more than one label/tag. Note: our data may need some cleaning. What we need: A JSON file with a list of label(s)/tag(s) for each article.