CourseSyllabiAnalysis

This project extracts the content from computer science university course syllabi, applies NLP processing to the content, connects to ConceptNet for ,apriori association rule mining

Goals

Predict how much academic rigor is require to be successful in a computer science course at East Tennessee State University given the course syllabus and basic characteristics of the course conditions

Determine the impact of dataset expansion via ConceptNet for increasing relationship identification within the dataset

Evaluate how courses are ranked and grouped after supervised prediction, because target values are subjective therefore the results are subjectively interpretable

Steps

Extract content from computer science university course syllabi in a variety of formats with accuracy via PDF Plumber, Pytesseract, and standard text cleaning

Cluster courses to fill in missing target values for the sake of training

Apply NLP processing to identify important terminology in individual documents

Use ConceptNet to expand the corpus content for increasing relationship potential among documents

Apply Apriori association rule mining to quantify relationships among documents within corpus

Construct feature set fit for supervised learning via MLP

Predict "quantity" of academic rigor required to be successful in a computer science course at East Tennessee State University (from a scale from 0 to 100 based on instructor self rankings) through MLP evaluated using Leave One Out Cross Validation (LOOCV) for context preservation

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
0_PDFPlumber_Extractor.ipynb		0_PDFPlumber_Extractor.ipynb
0_Pytesseract_Extractor.ipynb		0_Pytesseract_Extractor.ipynb
1_ComboPDF_Extractor.ipynb		1_ComboPDF_Extractor.ipynb
2_fixAItypo.ipynb		2_fixAItypo.ipynb
3_clusterTargetFillValues.ipynb		3_clusterTargetFillValues.ipynb
4_NgramIdentification.ipynb		4_NgramIdentification.ipynb
5_ConceptNetExpansion.ipynb		5_ConceptNetExpansion.ipynb
6_AprioriNgrams.ipynb		6_AprioriNgrams.ipynb
7_cleanCSV.ipynb		7_cleanCSV.ipynb
8_FeatureBuildExpansion.ipynb		8_FeatureBuildExpansion.ipynb
8_FeatureBuildExpansionMultiply.ipynb		8_FeatureBuildExpansionMultiply.ipynb
8_FeatureBuildNgrams.ipynb		8_FeatureBuildNgrams.ipynb
8_FeatureBuildNgramsMultipy.ipynb		8_FeatureBuildNgramsMultipy.ipynb
9_MLP_Expansion.ipynb		9_MLP_Expansion.ipynb
9_MLP_ExpansionMultiply.ipynb		9_MLP_ExpansionMultiply.ipynb
9_MLP_Ngrams.ipynb		9_MLP_Ngrams.ipynb
9_MLP_NgramsMultiply.ipynb		9_MLP_NgramsMultiply.ipynb
Course Syllabus Analysis NLP Project Final Report.pdf		Course Syllabus Analysis NLP Project Final Report.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CourseSyllabiAnalysis

Goals

Steps

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Glodanale/CourseSyllabiAnalysis

Folders and files

Latest commit

History

Repository files navigation

CourseSyllabiAnalysis

Goals

Steps

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages