Label projects fall under three broad categories: token-based, row-based, and document-based. We'll take a look at examples of each below.
In token-based labeling, the labeling process can be done by labeling tokens or spans of tokens. Token-based labeling is well-suited for projects such as NER and POS. Here are the things that are important for you to know before labeling your project.
The label box will appear when you click on the tokens. You can click manually on the labels or use the corresponding keyboard shortcuts by typing "1", "2", "3", or "4".
Due to a limited number of numerals on the keyboard, keyboard shortcuts are only available for the first 9 labels.
You can search for labels in the label box by starting to type out parts of the label. In the following example, you could type "NN" followed by "3" to apply the NNS label.
It is possible to apply overlapping labels, or even multiple labels to the same token or span.
The first way is selecting the token or span, click shift
+ the appropriate label.
The second way is to use keyboard shortcuts. Select the span, use up
and down
to find the right label, then press shift
+ Enter
.
You can Edit the sentence by right-click on the row then choose Edit from the pop up menu shown. When editing, we will show you the original sentence. Please take a note that we will tokenize the sentence at the server. To apply changes you can do one of these:
Press enter
after editing or click anywhere outside the edit text area to save the changes.
Press ctrl+enter
if you want to use space as token separator and not using default tokenizer that Datasaur has.
You can add new lines by right-clicking on the row then choosing Insert Line Above or Insert Line Below.
You can delete lines by right-clicking on the row then choosing Delete Line.
You can delete all the labels on a given sentence by right-clicking anywhere in the sentence and choosing Delete Sentence Labels.
Once you have labeled tokens, you can draw arrows between labels.
You can even apply labels to the arrows themselves. In order to do so, double-click the arrow and select the appropriate label.
You can also reverse arrows, delete arrows, and delete labels by right-clicking on the arrow.
You can move to the desired line via the Go menu.
Go to Start will take you to the first line.
Go to End will take you to the last line.
Go to Line will take you to a specific line.
Go to Next Unlabeled Token will take you to the next unlabeled token.
Go to Previous Unlabeled Token will take you to the previous unlabeled token.
Go to Next Unlabeled Line will take you to the next unlabeled line.
Go to Previous Unlabeled Line will take you to the previous unlabeled line.
Go to Next File will take you to the next file.
Go to Previous File will take you to the previous file.
Deleting labels can be done in two ways:
Right-clicking the label and clicking on Delete label.
Click Delete
or Backspace
on your keyboard.
Paragraph/sentence labeling optimizes the interface for when you are applying labels to longer sentences or entire paragraphs. You will have the option to show the label as an index bar on the left-hand side, and hide the label above the text to avoid clutter.
You can enable this by altering the project settings in token-based projects:
Click File on the top left, then click Settings --> General.
Check Show index bar for labels.
Select Show labels only when token is clicked (optional).
Character-based labeling allows you to select and apply labels on a character-level basis, so you don't have to select the entire token.
Click File on the top left, then click Settings --> Project.
Check Allow character-based labeling.
Labeling the character can be done in two ways:
Select the desired character using your mouse.
Select the character using keyboard shortcuts shift
+ right
.
If you want to label the entire sentence, you can simply click on the line number.
Select multiple lines at once can be done by holding Shift
+ clicking the desired line number.
​
Once you have finished labeling, click Mark document as complete. This will signify to your team you are done with the document and it is ready for Review or Export.
If you choose row-based or document-labeling as the task type, the goal of labeling is to answer the questions. You can answer the questions in the Document Labeling extension on the right side.
You can navigate to the next question by using your mouse or typing Tab
on the keyboard.
You can move to the desired row via the Go menu.
Go to Start will take you to the first row.
Go to End will take you to the last row.
Go to Line will take you to a specific row.
Go to Next Unlabeled Line will take you to the next unlabeled line.
Go to Previous Unlabeled Line will take you to the previous unlabeled line.
Go to Next File will take you to the next file.
Go to Previous File will take you to the previous file.
The asterisk (*) next to the question indicates that the question requires an answer - leaving a required field blank will trigger an error.
If you create Text Field, Text Area, and Date type questions, you are able to sort and filter the columns.
For the Text Field and Text Area columns, you can filter by searching the keyword.
For the Date column, you can filter the date range.
When the answer type is Dropdown, keyboard shortcuts are displayed in the extension. In the example below, you can click "1" on your keyboard to apply Fiction
as an answer.
You are allowed to see all rows or the unlabeled rows by clicking the View menu. This feature will help you if your project has many rows.
You can hide and rename headers by right-clicking on the header.
Once you have finished labeling, click Mark document as complete.
If you have already created a project, you can change the configurations through Settings.
You can click the File menu and choose Settings.
The General tab allows you to show index bar for labels. This is recommended if you want to label multiple lines or paragraphs.
The Project tab allows you to change labeling settings for the task.
The Administrator tab allows you to change the assigned labelers and settings related to your project. This is only available for projects created in Datasaur Teams.
The Project tab allows you to change the number of rows displayed per page and configure how media should be expanded.
The Administrator tab allows you to change the assigned labelers and settings related to your project. This is only available for projects created in Datasaur Teams.
Datasaur has 5 team roles. You will find these roles in team management.
You will automatically become an admin when you create a team. As an admin, you are allowed to invite team members, create projects, assign team members as labelers or reviewers, and promote team members as admins. You can also access the high-level overview of your team's projects and progress through the Overview page.
We divide reviewers into team scope and project scope. If the admin assigns you as a team reviewer, you are able to see all projects and review them.
A project reviewer can be assigned in Step 4 of the project creation wizard. As a project reviewer, you will only see and review the projects that are assigned to you.
The most common role, anyone an admin invites to a team is assigned as a labeler. As a labeler, you will only see the projects that are assigned to you.
If you're a team admin, you can also be a labeler. This role will automatically selected if you assign yourself in Step 4.