CourtDocs

Capabilities

Pulling data from California rehab institutions

Pulling decisions from NY Courts of appeal

Crawling all documents from the the 4 courts of appeal, downloading them and converting them from HTML to TXT
Parsing the docs, to summarize for stats.

Parsing the text appeal documents, producing a csv file with extracted attributes

To run the application(s), look into the 'scripts' folder

Please note that all documentation is found in the 'doc' folder in this project

Data

About 100K of appeal documents scraped from the NY State Court of appeals are found in S3 here

The (hopefully) latest results of processing, extracted with this CourtDoc regex's are here

Development

For development I prefer IntelliJ.
- It allows multiple configurations for running and debugging
- Overall better, I can't say where NetBeans would exceed IntelliJ, except in the UI editor

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
bin		bin
conf		conf
doc		doc
scripts		scripts
src		src
test-data		test-data
.gitignore		.gitignore
README.md		README.md
changes.txt		changes.txt
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CourtDocs

Capabilities

Pulling data from California rehab institutions

Pulling decisions from NY Courts of appeal

Parsing the text appeal documents, producing a csv file with extracted attributes

Data

Development

About

Releases

Packages

Contributors 2

Languages

shmsoft/CourtDocs

Folders and files

Latest commit

History

Repository files navigation

CourtDocs

Capabilities

Pulling data from California rehab institutions

Pulling decisions from NY Courts of appeal

Parsing the text appeal documents, producing a csv file with extracted attributes

Data

Development

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages