datedown package¶

Submodules¶

datedown.dates module¶

Module for getting date lists in different intervals. This only covers the basics like n-hourly, n-daily and dekadal. For the generation of more complex datetime lists a package like pandas can be used.

datedown.dates.daily(start, end)[source]¶

Iterate over list of daily datetime objects.

Parameters:	start (datetime.datetime) – first date yielded end (datetime.datetime) – last date yielded
Yields:	dt (datetime.datetime) – datetime object between start and end in daily steps.

datedown.dates.hourly(start, end)[source]¶

Iterate over list of hourly datetime objects.

Parameters:	start (datetime.datetime) – first date yielded end (datetime.datetime) – last date yielded
Yields:	dt (datetime.datetime) – datetime object between start and end in daily steps.

datedown.dates.n_daily(start, end, n)[source]¶

Iterate over list of n-daily datetime objects.

Parameters:	start (datetime.datetime) – first date yielded end (datetime.datetime) – last date yielded n (int) – step size
Yields:	dt (datetime.datetime) – datetime object between start and end in daily steps.

datedown.dates.n_hourly(start, end, n)[source]¶

Iterate over list of n-hourly datetime objects.

Parameters:	start (datetime.datetime) – first date yielded end (datetime.datetime) – last date yielded n (int) – number of hours between each yielded datetime object.
Yields:	dt (datetime.datetime) – datetime object between start and end in n-hourly steps.

datedown.down module¶

Module that puts the things toghether.

datedown.down.check_downloaded(urls, targets)[source]¶

Check if files that should be downloaded exist. If not then return a list of not downloaded URLs.

Parameters:

urls (iterable) – iterable over url strings
targets (iterable) – paths where to store the files

Returns:

not_urls (list) – list of urls that do not exist locally
not_fnames (list) – list of filenames that do not exist locally

datedown.down.download(urls, targets, num_proc=1, username=None, password=None, recursive=False, filetypes=None)[source]¶

Download the urls and store them at the target filenames.

Parameters:

urls (iterable) – iterable over url strings
targets (iterable) – paths where to store the files
num_proc (int, optional) – Number of parallel downloads to start
username (string, optional) – Username to use for login
password (string, optional) – Password to use for login
recursive (boolean, optional) – If set then no exact filenames can be given. The data will then be downloaded recursively and stored in the target folder.
filetypes (list, optional) – list of file extension to download, any others will no be downloaded

datedown.fname_creator module¶

Module for creating the filenames from the datetimes.

datedown.fname_creator.create_dt_fpath(dt, root, fname, subdirs=[])[source]¶

Create filepaths from root + fname and a list of subdirectories. fname and subdirs will be put through dt.strftime.

Parameters:	dt (datetime.datetime) – date as basis for the URL root (string) – root of the filenpath fname (string) – filename to use subdirs (list, optional) – list of strings. Each element represents a subdirectory. For example the list [‘%Y’, ‘%m’] would lead to a URL of `root/YYYY/MM/fname` or for a dt of datetime(2000,12,31) `root/2000/12/fname`
Returns:	fpath – Full filename including path
Return type:	string

datedown.interface module¶

Interface for the package.

datedown.interface.download_by_dt(dts, url_create_fn, fpath_create_fn, download_fn, passes=3, recursive=False)[source]¶

Download data for datetimes. If files are missing try again passes times.

Parameters:

dts (list) – list of datetime.datetime objects
url_create_fn (function) – function that creates an URL from a datetime object
fpath_create_fn (function) – function that creates a filename from a datetime object
download_fn (function) – function that transfers data from a list of URLs to a list of filenames. Takes two arguments (url_list, fname_list)
passes (int, optional) – if files are missing then try again passes times
recursive (boolean, optional) – If set then no exact filenames can be given. The data will then be downloaded recursively and stored in the target folder. No checking of downloaded files is possible in this case.

datedown.interface.main(args)[source]¶

datedown.interface.main_recursive(args)[source]¶

datedown.interface.mkdate(datestring)[source]¶

datedown.interface.n_hours(intervalstring)[source]¶: Convert an interval string like 1D, 6H etc. to the number of hours it represents.

datedown.interface.parse_args(args)[source]¶

Parse command line parameters

Parameters:	args – command line parameters as list of strings
Returns:	command line parameters as `argparse.Namespace`

datedown.interface.parse_args_recursive(args)[source]¶

Parse command line parameters for recursive download

Parameters:	args – command line parameters as list of strings
Returns:	command line parameters as `argparse.Namespace`

datedown.interface.run()[source]¶

datedown.interface.run_recursive()[source]¶

datedown.urlcreator module¶

Module for creating the URLs from the datetimes.

datedown.urlcreator.create_dt_url(dt, root, fname, subdirs=[])[source]¶

Create URLs from root + fname and a list of subdirectories. fname and subdirs will be put through dt.strftime.

Parameters:	dt (datetime.datetime) – date as basis for the URL root (string) – root of the url fname (string) – filename to use subdirs (list, optional) – list of strings. Each element represents a subdirectory. For example the list [‘%Y’, ‘%m’] would lead to a URL of `root/YYYY/MM/fname` or for a dt of datetime(2000,12,31) `root/2000/12/fname`
Returns:	url
Return type:	string

datedown.wget module¶

Interface to wget command line utility.

datedown.wget.download(url, target, username=None, password=None, cookie_file=None, recursive=False, filetypes=None)[source]¶

Download a url using wget. Retry as often as necessary and store cookies if authentification is necessary.

Parameters:

url (string) – URL to download
target (string) – path on local filesystem where to store the downloaded file
username (string, optional) – username
password (string, optional) – password
cookie_file (string, optional) – file where to store cookies
recursive (boolean, optional) – If set then no exact filenames can be given. The data will then be downloaded recursively and stored in the target folder.
filetypes (list, optional) – list of file extension to download, any others will no be downloaded

datedown.wget.map_download(url_target, username=None, password=None, cookie_file=None, recursive=False, filetypes=None)[source]¶

variant of the function that only takes one argument. Otherwise map_async of the multiprocessing module can not work with the function.

Parameters:

url_target (list) – first element the url, second the target string
username (string, optional) – username
password (string, optional) – password
cookie_file (string, optional) – file where to store cookies
recursive (boolean, optional) – If set then no exact filenames can be given. The data will then be downloaded recursively and stored in the target folder.
filetypes (list, optional) – list of file extension to download, any others will no be downloaded

datedown package¶

Submodules¶

datedown.dates module¶

datedown.down module¶

datedown.fname_creator module¶

datedown.interface module¶

datedown.urlcreator module¶

datedown.wget module¶

Module contents¶

Table Of Contents

Related Topics

This Page