- Crawling all documents from the the 4 courts of appeal, downloading them and converting them from HTML to TXT
- Parsing the docs, to summarize for stats.
To run the application(s), look into the 'scripts' folder
Please note that all documentation is found in the 'doc' folder in this project
About 100K of appeal documents scraped from the NY State Court of appeals are found in S3 here
The (hopefully) latest results of processing, extracted with this CourtDoc regex's are here
- For development I prefer IntelliJ.
- It allows multiple configurations for running and debugging
- Overall better, I can't say where NetBeans would exceed IntelliJ, except in the UI editor