Comment on page


Label for Image Captioning


A Text schema element creates data as a string.
Text labels enable image captioning and other generic use cases.
Architectures such as multi-modal attention models take text as input. Common use cases include producing image captions for accessibility.

Define a Text Label

Plainsight does not support model training for this label type.
  1. 1.
    Navigate to the "Label Definitions" tab of your dataset
  2. 2.
    Type label name in the "Name" field
  3. 3.
    Choose Text from the "Type" dropdown
  4. 4.
    Select desired text type from "Input Type" dropdown
  5. 5.
    Click "Save" button
Supported Input Text Types:
  • email
  • number
  • tel
  • url
After you've defined a Text label, you can start using it to label your images.

Data Format

The Text label creates an array of objects with data consisting of the text string.
"type": "text",
"data": "A girl flying a kite on the beach",
"children": {}

Annotations with Text Labels

Once you've defined a Text label, you can begin annotating your data.

Training with Text Annotations

Plainsight doesn't currently support SmartML model training for text annotations. Text annotations can be exported using the Plainsight format and trained outside of Plainsight.