I'm confused about the corpus proto files used in this project. Are they transformed by original AST JSON files?