Method
createDatasetVersionFromFileConnector POST
Copy POST

Creates a new version of the specified dataset.

Arguments:

REQUIRED KEY TYPE DESCRIPTION
Yes datasetId str Unique string identifier associated with the dataset.
No location str External URI to import the dataset from. If not specified, the last location will be used.
No fileFormat str File format to be used. If not specified, the service will try to detect the file format.
No csvDelimiter str If the file format is CSV, use a specific CSV delimiter.
No mergeFileSchemas bool Signifies if the merge file schema policy is enabled.
No parsingConfig ParsingConfig Custom config for dataset parsing.
KEY TYPE Description
csvDelimiter str Delimiter for CSV files. Defaults to None.
escape str Escape character for CSV files. Defaults to '"'.
filePathWithSchema str Path to the file with schema. Defaults to None.
No sqlQuery str The SQL query to use when fetching data from the specified location. Use `__TABLE__` as a placeholder for the table name. For example: "SELECT * FROM __TABLE__ WHERE event_date > '2021-01-01'". If not provided, the entire dataset from the specified location will be imported.
Note: The arguments for the API methods follow camelCase but for Python SDK underscore_case is followed.

Response:

KEY TYPE DESCRIPTION
success Boolean true if the call succeeded, false if there was an error
result DatasetVersion
KEY TYPE Description
datasetVersion str The unique identifier of the dataset version.
status str The current status of the dataset version
datasetId str A reference to the Dataset this dataset version belongs to.
size int The size in bytes of the file.
rowCount int Number of rows in the dataset version.
fileInspectMetadata dict Metadata information about file's inspection. For example - the detected delimiter for CSV files.
createdAt str The timestamp this dataset version was created.
error str If status is FAILED, this field will be populated with an error.
incrementalQueriedAt str If the dataset version is from an incremental dataset, this is the last entry of timestamp column when the dataset version was created.
uploadId str If the dataset version is being uploaded, this the reference to the Upload
mergeFileSchemas bool If the merge file schemas policy is enabled.
databaseConnectorConfig dict The database connector query used to retrieve data for this version.
applicationConnectorConfig dict The application connector used to retrieve data for this version.
invalidRecords str Invalid records in the dataset version

Exceptions:

TYPE WHEN
DataNotFoundError

datasetId is not found.

InvalidEnumParameterError

An invalid value is passed for fileFormat.

InvalidParameterError

The location is not a valid cloud location URI.

PermissionDeniedError

The location has not been verified with Abacus.AI.

DataNotFoundError

No file was found at the specified location.

Language: