Invesco Insights Clustering Analysis

Business Case

Editorial content on our website is tagged poorly on the backend. This hampers analysis and subsequent decision-making. We need to accomplish two things:

Understand the underlying themes within our content
Turn those themes into tags that can be properly applied to content either manually or programmatically (the latter to optimize tag enforcement)

Proposed Solution

Utilize data science, and more specifically, machine learning to algorithmically cluster this content as an exploratory analysis to uncover potiential theme candidates.

Project Status

A code base refactor is currently ongoing that includes the following action items:

Adding doc strings to all functions, methods, and classes for legibility
Building out markdown within the second version of the notebook to better tell the story of this analysis
Optimization of model with feature engineering, data enrichment, and/or regularization
Explaining the mathematical foundation for decision-making
Implementation of K-Means Error to select optimal model run by analyzing total intra cluster variance
Implementing testing via doctests or unit testing is on the wishlist but is not mission critical given the business case (nice to have, not a must have)

There are others that will be added to this list as the project progresses.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
modules		modules
notebooks		notebooks
pdfs		pdfs
public_data/reuters		public_data/reuters
scripts		scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Invesco Insights Clustering Analysis

Business Case

Proposed Solution

Project Status

About

Uh oh!

Releases

Packages

Languages

xvrgill/theme-tag-clustering

Folders and files

Latest commit

History

Repository files navigation

Invesco Insights Clustering Analysis

Business Case

Proposed Solution

Project Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages