Data Availability
Python scripts used in this study can be found on github. Models generated by this study can be found on Hugging Face. TCGA pathology report text can be found on github. CUIMC pathology reports are not available due to HIPAA compliance.
https://github.com/tatonetti-lab/tnm-stage-classifier
https://github.com/tatonetti-lab/tcga-path-reports
https://huggingface.co/jkefeli/CancerStage_Classifier_T