Dataset

dataset_client.create_dataset(body)[source]

Create a new dataset with the specified name, description, and other details.

Table:Input parameters summary:

Field

Input

Type

Description

dataset_name

body

str

The name of the new dataset to create

dataset_description

body

str

A description for the new dataset

dataset_type

body

str

The type of the new dataset (e.g. Image, Text, Audio, etc.).

item_format

body

str

The format of the dataset items (e.g. image/jpeg, text/plain, audio/mpeg, etc.)

metadata

body

dict

A dictionary of metadata to associate with the new dataset.

s3IntermediateUrl

body

str

The S3 URL where the dataset items are processed

aws_credentials_enabled

body

bool

Whether AWS credentials are enabled for the dataset

aws_credentials

body

dict

A dictionary containing AWS credentials for the dataset

Sample Request Body

from pprint import pprint
from tensoract.client import Tensoract


# Initialize Tensoract client
dataset_client = Tensoract(api_url="YOUR_API_URL",api_key="YOUR_API_KEY")

    body={
    "dataset_name": "Test-Dataset-API",
    "dataset_description": "This is a test Dataset",
    "dataset_type": "Image",
    "item_format": "image/tiff",
    "metadata": {
        "Dataset": "Image Dataset"
    },
    "s3IntermediateUrl": "s3://************************************",
    "aws_credentials_enabled": true,
    "aws_credentials": {
        "awsKey": "***********",
        "awsSecret": "**************",
        "awsRegion": "***********",
        "awsType": "key"
    }
    }
    pprint(dataset_client.create_dataset(body))
Table:Response summary:

Field

Type

Instructions

dataset_id

str

Id of the dataset.

value

str

Description or message indicating the result.

status

str

The status of the operation

Sample Response

{
    "dataset_id": "e20859abab70f7df3fb0b7c7",
    "value": "Dataset updated",
    "status": "success"
}
dataset_client.delete_dataset(dataset_id)[source]

Delete Dataset.

Table:Input parameters summary:

Field

Input

Type

Description

dataset_id

body

str

The Id of the dataset to delete

from pprint import pprint
from tensoract.client import Tensoract


# Initialize Tensoract client
dataset_client = Tensoract(api_url="YOUR_API_URL",api_key="YOUR_API_KEY")

# Invoke delete_dataset method
response = dataset_client.delete_dataset(dataset_id="8bc05ec26ab161e218a0e842")
pprint(response)
Table:Response summary:

Field

Type

Description

status

str

The status of the operation

value

str

The result or value of the operation

Sample Response

{
     "status": "success",
     "value": "dataset deleted"
}
dataset_client.get_dataset(dataset_id)[source]

Get a dataset by ID.

Table:Input parameters summary:

Field

Input

Type

Description

dataset_id

body

str

The Id of the dataset to retrieve

from pprint import pprint
from tensoract.client import Tensoract


# Initialize Tensoract client
dataset_client = Tensoract(api_url="YOUR_API_URL",api_key="YOUR_API_KEY")

# Invoke get_project method
response = dataset_client.get_dataset(dataset_id="9373f3f9d3f1400cfba6ac6a")
pprint(response)
Table:Response summary:

Field

Type

Description

dataset_id

str

The Id of the dataset

dataset_name

str

The name of the new dataset to create

dataset_description

str

A description for the dataset

dataset_type

str

The type of the dataset

dataset_versions

list

A list of dictionaries containing version information for the dataset.

item_format

str

The format of the dataset items

metadata

dict

A dictionary containing metadata for the dataset.

aws_credentials_enabled

bool

Whether AWS credentials are enabled for the dataset

aws_credentials

dict

A dictionary containing AWS credentials for the dataset

s3IntermediateUrl

str

The S3 URL for intermediate data

Sample Response

{
    "dataset_id": "9373f3f9d3f1400cfba6ac6a",
    "dataset_name": "Test-Dataset-Recipes",
    "dataset_type": "PDFOCR",
    "dataset_versions": [
        {
        "id": 0,
        "items": 29,
        "name": "V.1",
        "locked": false,
        "created": 1690173205,
        "modified": 1690173205
        }
    ],
    "dataset_items": 29,
    "metadata": {
        "Dataset": "Recipes"
    },
    "company": "60f38299446de8dabe9207e5",
    "company_name": "Objectways",
    "aws_credentials_enabled": true,
    "aws_credentials": {
        "awsType": "key",
        "awsKey": "********************",
        "awsSecret": "****************************************",
        "awsRegion": "*********"
    },
    "s3IntermediateUrl": "*******************************"
}
dataset_client.list_datasets(dataset_id, dataset_name, active)[source]

List datasets.

Table:Input parameters summary:

Field

Input

Type

Description

dataset_id

body

str

The Id of the dataset

dataset_name

body

str

The name of the dataset

active

body

bool

Whether to filter datasets by active status

Sample Request Body

{
    from pprint import pprint
    from tensoract.client import Tensoract


    # Initialize Tensoract client
    dataset_client = Tensoract(api_url="YOUR_API_URL",api_key="YOUR_API_KEY")

    # Invoke list_datasets method
    response = dataset_client.list_datasets(dataset_id=None,dataset_name=None,active=True)
    pprint(response)
}
Table:Response summary:

Field

Type

Description

dataset_id

str

The Id of the dataset

dataset_name

str

The name of the dataset

dataset_description

str

A description for the dataset

dataset_type

str

The type of the dataset

dataset_versions

list

A list of dictionaries containing version information for the dataset.

company

str

The Id of the company associated with the item.

company_name

str

The name of the company associated with the item.

Sample Response

[
        {
        "dataset_id": "c8a04fbf7c8f95ef8912338f",
        "dataset_name": "Test Dataset",
        "dataset_type": "Image",
        "dataset_versions": [
        {
            "id": 0,
            "items": 1,
            "name": "Base version",
            "locked": true,
            "created": 1657526913,
            "modified": 1657526913
        },
        {
            "id": 1,
            "items": 59,
            "name": "Version 2",
            "locked": true,
            "created": 1657527052,
            "modified": 1657527052
        }
        ],
        "company": "60f38299446de8dabe9207e5",
        "company_name": "Objectways"
    },
    {
        "dataset_id": "bd35da16cd39283f119454cc",
        "dataset_name": "Test Dataset 3",
        "dataset_type": "Image",
        "dataset_versions": [
        {
            "id": 0,
            "items": 1,
            "name": "V.1",
            "locked": true,
            "created": 1657708086,
            "modified": 1657708086
        },
        {
            "id": 1,
            "items": 5,
            "name": "V.2",
            "locked": true,
            "created": 1657709319,
            "modified": 1657709319
        },

        ],
        "company": "60f38299446de8dabe9207e5",
        "company_name": "Objectways"
    }
]
dataset_client.update_dataset(dataset_id, body)[source]

Update Dataset.

Table:Input parameters summary:

Field

Input

Type

Description

dataset_id

body

str

The Id of the dataset

dataset_name

body

str

The name of the new dataset to create

dataset_description

body

str

A description for the dataset

dataset_type

body

str

The type of the dataset

dataset_versions

body

list

A list of dictionaries containing version information for the dataset.

item_format

body

str

The format of the dataset items

metadata

body

dict

A dictionary containing metadata for the dataset.

aws_credentials_enabled

body

bool

Whether AWS credentials are enabled for the dataset

aws_credentials

body

dict

A dictionary containing AWS credentials for the dataset

s3IntermediateUrl

body

str

The S3 URL for intermediate data

Sample Request Body

from pprint import pprint
from tensoract.client import Tensoract


# Initialize Tensoract client
dataset_client = Tensoract(api_url="YOUR_API_URL",api_key="YOUR_API_KEY")

body={
        "dataset_name": "Test-Dataset-API",
        "dataset_description": "Update dataset",
        "dataset_type": "Image",
        "item_format": "image/tiff",
        "metadata": {
            "Dataset": "Image Dataset"
        },
        "s3IntermediateUrl": "s3://************************************",
        "aws_credentials_enabled": true,
        "aws_credentials": {
            "awsKey": "***********",
            "awsSecret": "**************",
            "awsRegion": "***********",
            "awsType": "key"
        }
    }
response = dataset_client.update_dataset(dataset_id="85bfac4fb66d935312a373ec",body=body)
pprint(response)
Table:Response summary:

Field

Type

Description

dataset_id

str

The ID of the dataset

dataset_name

str

The name of the new dataset to create

dataset_description

str

A description for the dataset

dataset_type

str

The type of the dataset

dataset_versions

list

A list of dictionaries containing version information for the dataset.

item_format

str

The format of the dataset items

metadata

dict

A dictionary containing metadata for the dataset.

aws_credentials_enabled

bool

Whether AWS credentials are enabled for the dataset

aws_credentials

dict

A dictionary containing AWS credentials for the dataset

s3IntermediateUrl

str

The S3 URL for intermediate data

Sample Response

{
    "dataset_id": "85bfac4fb66d935312a373ec",
    "dataset_name": "Test-Dataset-API",
    "dataset_type": "Image",
    "dataset_description": "Update Dataset",
    "dataset_versions": [
        {
        "id": 0,
        "items": 0,
        "name": "V.1",
        "locked": false,
        "created": 1678442477,
        "modified": 1678442477
        }
    ],
    "item_format": "image/tiff",
    "metadata": {
        "Dataset": "Image Dataset"
    },
    "company": "60f38299446de8dabe9207e5",
    "aws_credentials_enabled": true,
    "aws_credentials": {
        "awsKey": "********************",
        "awsSecret": "****************************************",
        "awsRegion": "*********",
        "awsType": "key"
    },
    "s3IntermediateUrl": "s3://************************************",

}