Skip to content
This repository has been archived by the owner on Sep 20, 2024. It is now read-only.

Filter ZIP files by contents #111

Open
nightsh opened this issue Apr 10, 2020 · 0 comments
Open

Filter ZIP files by contents #111

nightsh opened this issue Apr 10, 2020 · 0 comments
Labels

Comments

@nightsh
Copy link
Contributor

nightsh commented Apr 10, 2020

[STUB]

While scraping, we can download, extract and investigate the contents of the ZIP files to eliminate zipped resources that don't contain data files.

This would likely be a part of the Airflow DAGs and would likely take a long time to complete (but however, shorter than doing it manually).

To Be Continued...

@nightsh nightsh added the future label Apr 10, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant