Creating a Project
After signing in, you will be automatically directed to your personal workspace. You can see the Project shortcuts and the list of Projects that you are working on.
Creating a Project can be done by clicking on the Custom project button, or on one of the Project Template shortcuts. In this article, we will walk through creating a custom Project and the next article describes each of the Project Templates. If you would like a tutorial on creating a project for token, audio, OCR, or document based Project please watch their corresponding Youtube videos.
The Project Creation Wizard is a tool for creating custom Projects. It has three basic steps: add data, preview data, and labeler tasks. (A fourth step is available for team admins to assign labelers).
Datasaur supports a wide variety of formats, including:
You are able to upload multiple files, but all files in a Project must be the same file format.
Uploading the data can be done in two ways: drag and drop and browsing files from your hard drive.
Note: the maximum file size allowed is 50 MB.
This step allows you to preview what your data will look like.
The Number of rows displayed per page setting determines how many rows should be displayed on one page. Choosing All rows will allow infinite scrolling through all the data.
The Expand Media setting allows you to choose the media resolution that will be displayed on the page. Higher resolutions will allow you to view the media in greater detail, but will take longer to load.
The Enable markdown parsing allows you to parse markdown in row-based project. We recommend preprocessing your file with Markdown syntax before uploading it to Datasaur.
Markdown parsing checked
You can also edit the header and hide columns by right-clicking the header.
- If you choose Hide column, this setting will be propagated to the labelers as well. Labelers can show it later if they want.
- If you choose Hide column from labeler, the labelers won't be able to show the columns at all.
In this last step, you must choose whether you want to label individual tokens or answer questions about the text.
There are three task types that you can choose: token-based, row-based, and document-based.
- Token-based allows you to label tokens in a text document.
- If you would like a tutorial on token-based Projects, please watch the corresponding YouTube videos.
- Row-based allows you to label data in tables on a row-by-row basis.
You can create or upload label sets
If you select token-based, you will be asked to create or upload a label set. Label sets contain the label classes labelers will be able to choose from.
If you select row-based or document-based, you will be asked to fill in the set of questions. There are nine question types available.
After adding the questions and creating your project, your labeling task will look something like the example below. There are also shortcuts to creating specific project types that we discuss in detail here.
Finally, if you are already in a project, you can also create new projects from the File Menu. Once you click one of the formats below, you will be directed straight to the new blank project.