Skip to content

shmsoft/CourtDocs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

83 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CourtDocs

Capabilities

Pulling data from California rehab institutions

Pulling decisions from NY Courts of appeal

  • Crawling all documents from the the 4 courts of appeal, downloading them and converting them from HTML to TXT
  • Parsing the docs, to summarize for stats.

Parsing the text appeal documents, producing a csv file with extracted attributes

To run the application(s), look into the 'scripts' folder

Please note that all documentation is found in the 'doc' folder in this project

Data

About 100K of appeal documents scraped from the NY State Court of appeals are found in S3 here

The (hopefully) latest results of processing, extracted with this CourtDoc regex's are here

Development

  • For development I prefer IntelliJ.
    • It allows multiple configurations for running and debugging
    • Overall better, I can't say where NetBeans would exceed IntelliJ, except in the UI editor

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published