TABILAB repository About and Policies
- Mission Statement
- Terms of Service
- About Repository
- About TABILAB
- License Agreement and Contracts
- Intellectual Property Rights
- Metadata Policy
- Preservation Policy
- Citing Data Policy
The Objective of TABILAB Language Resources service is to provide open and free tools and resources for linguistics and natural language processing generally in Turkish.
Terms of Service
To achieve our mission statement, we set out some ground rules through the Terms of Service. By accessing or using any kind of data or services provided by the Repository, you agree to abide by the Terms contained in the above mentioned document.
Data in TABILAB repository are made available under the licence attached to the resources. In case there is no licence, data is made freely available for access, printing and download for the purposes of non-commercial research or private study. Users must acknowledge in any publication, the Deposited Work using a persistent identifier (see Citing Data), its original author(s)/creator(s), and any publisher where applicable. Full items must not be harvested by robots except transiently for full-text indexing or citation analysis. Full items must not be sold commercially unless explicitly granted by the attached licence without formal permission of the copyright holders.
It is like a library for linguistic data and tools.
- Search for data and tools and easily download them.
- Deposit the data and be sure it is safely stored, everyone can find it, use it, and correctly cite it (giving you credit)
TABILAB (Text and Bioinformatics Lab) is a lab that works on Natural Language Processing and Bioinformatics at the Bogazici University, Istanbul, Turkey.
License Agreement and Contracts
At the moment, TABILAB distinguishes three types of contracts.
- For every deposit, we enter into a standard contract with the submitter, the so-called "Distribution License Agreement", in which we describe our rights and duties and the submitter acknowledges that they have the right to submit the data and gives us (the repository centre) right to distribute the data on their behalf.
- Everyone who downloads data is bound by the licence assigned to the item - in order to download protected data, one has to be authenticated and needs to electronically sign the licence. A list of available licenses in our repository can be found here.
- For submitters, there is a possibility for setting custom licences to items during the submission workflow.
Intellectual Property Rights
As mentioned in the section License Agreement and Contracts, we require the depositor of data or tools to sign a Distribution License Agreement, which specifies that they have the right to submit the data and gives us (the repository centre) right to distribute the data on their behalf. This means that depositors are solely responsible for taking care of IPR issues before publishing data or tools by submitting them to us.
Should anyone have a suspicion that any of the datasets or tools in our repository violate Intellectual Property Rights, they should contact us immediately at our help desk.
Deposited content must be accompanied by sufficient metadata describing its content, provenance and formats in order to support its preservation and dissemination. Metadata are freely accessible and are distributed in the public domain (under CC0). However, we reserve the right to be informed about commercial usage of metadata from TABILAB repository including a description of your use case at Help Desk.
TABILAB is committed to the long-term care of items deposited in the repository, to preserve the research and to help in keeping research replicable and strives to adopt the current best practice in digital preservation.
To fulfill the commitments, the repository ensures that datasets are ingested and distributed in accordance with their license (see agreements and contracts). Sometimes (for licenses that do not permit public access) this means only authorized users can access the dataset.
The submission workflow as described in deposit and the work of our editors ensures discoverability (by requiring accurate metadata) via our search engine, externally through OAI-PMH and in page metadata for certain web crawlers. Metadata are freely accessible.
There are various automated procedures including fixity checks, to ensure integrity of the submitted datasets and completeness of metadata. On the system level we employ various on-site and off-site backup strategies and hardware monitoring. The datasets are accessible online.
We view data and tools as primary research outputs, each submission receives a Persistent IDentifier for reference and the users are guided to use them. Changes in a dataset after it has been published are not permitted, new submission is required instead. The old and new submissions are linked through their metadata (see new version guide for more details).
The various export options offered by the repository system (DSpace) ensures that data and their metadata are not locked in and can be moved to a different repository system.
The repository encourages the usage of specific file formats as recommended by CLARIN. The preferred file formats will change over time, in which case the repository will make every effort to migrate to other formats, while keeping originals intact for reproducibility purposes (ie. migrated item will be a new repository record linked to the old). The guiding principles for format selection are: open standards are preferred over proprietary standards, formats should be well-documented, verifiable and proven, text-based formats are preferred over binary formats where possible, in the case of digitalization of analogue signal lossless or no compression is recommended.