-
-
Notifications
You must be signed in to change notification settings - Fork 115
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Guam: ParserError: Unknown string format: Guam 12, 11 #789
Comments
This appears broken |
The problem is the date regex We are missing all opinions that have that date format. In CL we only have 70 opinions for this court_id, 2 from a not-scraper source, the rest from the scraper from 2021 to the present. I counted the opinions for the years present on CL, and there should be 75, so we are missing 7. I ran the backscraper on the PR with some problems. From 2017 backwards, some records have no docket. For example, the only record with no docket from a year more recent than 2017: From 2008 backwards, most records have no dates so they can't be collected without triggering an error on AbstractSite._check_sanity() |
Solves freelawproject#789 - Validated and improved regexes for date, docket and citation
Sentry Issue: COURTLISTENER-5PX
The text was updated successfully, but these errors were encountered: