Get URL of associated file. get_url_*
functions return a URL as
a string. This can be then used in other functions such as curl::curl_download()
.
get_url(
file,
dataset = NULL,
format = c("original", "bundle"),
key = Sys.getenv("DATAVERSE_KEY"),
server = Sys.getenv("DATAVERSE_SERVER"),
original = TRUE,
...
)
get_url_by_name(
filename,
dataset,
format = c("original", "bundle"),
key = Sys.getenv("DATAVERSE_KEY"),
server = Sys.getenv("DATAVERSE_SERVER"),
original = TRUE,
...
)
get_url_by_id(
fileid,
dataset = NULL,
format = c("original", "bundle"),
key = Sys.getenv("DATAVERSE_KEY"),
server = Sys.getenv("DATAVERSE_SERVER"),
original = TRUE,
...
)
get_url_by_doi(
filedoi,
dataset = NULL,
format = c("original", "bundle"),
key = Sys.getenv("DATAVERSE_KEY"),
server = Sys.getenv("DATAVERSE_SERVER"),
original = TRUE,
...
)
An integer specifying a file identifier; or a vector of integers
specifying file identifiers; or, if used with the prefix "doi:"
, a
character with the file-specific DOI; or, if used without the prefix, a
filename accompanied by a dataset DOI in the dataset
argument, or an object of
class “dataverse_file” as returned by dataset_files
.
Can be a vector for multiple files.
A character specifying a persistent identification ID for a dataset,
for example "10.70122/FK2/HXJVJU"
. Alternatively, an object of class
“dataverse_dataset” obtained by dataverse_contents()
.
A character string specifying a file format for download.
by default, this is “original” (the original file format). If NULL
,
no query is added, so ingested files are returned in their ingested TSV form.
For tabular datasets, the option “bundle” downloads the bundle
of the original and archival versions, as well as the documentation.
See https://guides.dataverse.org/en/latest/api/dataaccess.html for details.
A character string specifying a Dataverse server API key. If one
is not specified, functions calling authenticated API endpoints will fail.
Keys can be specified atomically or globally using
Sys.setenv("DATAVERSE_KEY" = "examplekey")
.
A character string specifying a Dataverse server.
Multiple Dataverse installations exist, with "dataverse.harvard.edu"
being the
most major. The server can be defined each time within a function, or it can
be set as a default via an environment variable. To set a default, run
Sys.setenv("DATAVERSE_SERVER" = "dataverse.harvard.edu")
or add DATAVERSE_SERVER = "dataverse.harvard.edu"
in one's .Renviron
file (usethis::edit_r_environ()
), with the appropriate domain as its value.
A logical, defaulting to TRUE. If a ingested (.tab) version is
available, download the original version instead of the ingested? If there was
no ingested version, is set to NA. Note in get_dataframe_*
,
original
is set to FALSE by default. Either can be changed.
Additional arguments passed to an HTTP request function,
such as GET
, POST
, or
DELETE
. See use_cache
for details
on how the R dataverse package uses disk and session caches to
improve network performance.
Filename of the dataset, with file extension as shown in Dataverse (for example, if nlsw88.dta was the original but is displayed as the ingested nlsw88.tab, use the ingested version.)
A numeric ID internally used for get_file_by_id
. Can be a vector for multiple files.
A DOI for a single file (not the entire dataset), of the form
"10.70122/FK2/PPIAXE/MHDB0O"
or "doi:10.70122/FK2/PPIAXE/MHDB0O"
.
Can be a vector for multiple files.
A string or a list of strings that are URLs.
This function does not download the associated data.
In contrast, get_dataframe()
downloads the requested file to a tempfile, and then uses R
to read it. And get_file(.., return_url = FALSE)
reads the binary file into
R's memory with httr::GET()
. get_url()
simply return the URL for download.
if (FALSE) { # \dontrun{
# get URLs
get_url_by_name(
filename = "nlsw88.tab",
dataset = "10.70122/FK2/PPIAXE",
server = "demo.dataverse.org"
)
# https://demo.dataverse.org/api/access/datafile/1734017?format=original
# For ingested, tab-delimited files
get_url_by_name(
filename = "nlsw88.tab",
dataset = "10.70122/FK2/PPIAXE",
original = FALSE,
server = "demo.dataverse.org"
)
# https://demo.dataverse.org/api/access/datafile/1734017
# To download to local directory
curl::curl_download(
"https://demo.dataverse.org/api/access/datafile/1734017?format=original",
destfile = "nlsw88.dta")
} # }