Role: Linguist - Data Specialist
Type: B2B
Location: Warsaw (Hybrid)
Role Overview:
The Linguist - Data Specialist (Ling II) is an early-career linguist focused on data-driven
tasks such as data collection, annotation, and analysis. This role supports the
development and evaluation of language technologies by providing high-quality linguistic
data.
Key Responsibilities:
• Data Collection & Annotation: Gathers and annotates linguistic data for use in
training and evaluating language models and other i18n tools.
• Quality Control: Reviews and validates annotated data to ensure accuracy and
consistency.
• Documentation: Maintains clear records of data sources, annotation guidelines,
and project progress.
• Continuous Learning: Develops expertise in new annotation tools, linguistic
concepts, and data management practices.
Qualifications:
• Bachelor's degree in general linguistics or computational linguistics, Computer
Science, Speech Science, language studies, or a related field, plus 1-2 years of
industry experience; OR a master's degree with relevant experience (internship,
thesis project, etc.).
• Knowledge of syntax, semantics, pragmatics, sociolinguistics, corpus linguistics,
and other areas of linguistics.
• Experience with data annotation, linguistic analysis, or corpus management.
Public
• Strong attention to detail and organizational skills.
• Familiarity with at least one annotation tool or platform.
• Experience with database queries and data analysis processes (i.e. SQL,
spreadsheets, or others)
• Familiarity with Python