Named Entity Recognition

Norwegian
Finnish
Hindi
Arabic
Swedish
Russian
Czech
Turkish
Danish
Hebrew

About this dataset

You get access to 24 categories of annotated named entities, ranging from the typical person names, locations, and company names to markers for date, time, and duration - amongst many others. Train models to be able to identify any entity relevant to your chatbot or NLP application!

The dataset features 150,000 sentences in Norwegian (Bokmal), Finnish, Turkish, Hindi, Arabic, Danish, Swedish, Hebrew, Russian, and Czech.

License Information

This dataset is covered by Defined.ai standard Data license agreement. The license agreement is perpetual and allows for the commercialization of all models built on the data.

Sample

NER_RU_Short_Sample.PNG

Download Free Samples

Tell us about yourself, and download a sample of the NER dataset
All fields are required

By clicking on the appropriate button or by downloading, installing, accessing, and/or using the data sample, you are agreeing with Defined.ai Privacy Policy, Terms of Use, and Data License Agreement.

You might also be interested in:

Parallel Corpora

4 billion units, 40 languages
Multilingual
Albanian
Arabic
+19
DAI logo
Defined.ai hosts the leading online marketplace for buying and selling AI data, tools and models, and offers professional services to help deliver success in complex machine learning projects. Defined.ai is a community of AI professionals building fair, accessible and ethical AI of the future.
Datasets
Contact
1201 3rd Avenue, STE 2200, Seattle WA
[email protected]
Wired logo
Forbes 2019 AI50 logo
CB insights logo
Forbes 2020 logo
Inc. 5000 logo
PME logo

© 2023 DefinedCrowd. All rights reserved.