logo inner

Linguistic Data Engineer Intern

Babel StreetSomerville | Massachusetts, United StatesOnsite
This job is no longer open

Babel Street is the trusted technology partner for the world’s most advanced identity intelligence and risk operations. We deliver advanced AI and data analytics solutions
providing unmatched, analysis-ready data regardless of language, proactive risk identification, 360-degree insights, high-speed automation, and seamless integration into existing systems. Babel Street empowers government and commercial organizations to transform high-stakes identity and risk operations into a strategic advantage.The actionable insights we deliver safeguard lives and protect critical assets around the world.  Babel Street is headquartered in Reston, Virginia, with regional offices in Boston, MA and Cleveland, OH, and international offices in Australia, Canada, Israel, Japan, and the U.K.

For more information, visit www.babelstreet.com.

About the Job


Babel Street is seeking a Linguistic Data Engineer Intern to be a part of a growing data team in support of several text analytics products.  This person will have the opportunity to work with multiple, discrete engineering teams providing clean, reliable data to train, develop, and evaluate natural language processing systems as well as consult on the language specific aspects of multilingual text.

Responsibilities: 


  • Assist with managing large-scale text mining, data acquisition and annotation projects
  • Train and supervise contractors as they perform manual annotation tasks
  • Measure reliability of parallel, manual annotations
  • Describe and demonstrate linguistic phenomena on a variety of natural languages
  • Survey and catalogue new data releases and best practices in data maintenance, conversion, and analytics

Qualifications:



  • Strong scripting abilities, especially Python or R
  • Data cleaning, conversion, organization
  • Parsing XML, JSON, tabular data sets
  • Scraping and collecting text from online resources including web sites and APIs

  • Ability to write and revise annotation guidelines
  • Familiarity with prominent linguistic annotation guidelines (e.g., Penn Treebank)
  • Ability to synthesize clear instructions and instructive examples

  • Knowledge of Linguistics and NLP applications including
  • Language identification
  • Tokenization
  • Part of speech tagging
  • Morphological analysis
  • Entity extraction, disambiguation, and linking
  • Syntactic parsing
  • Sentiment analysis

  • Familiarity with linguistic community resources and data providers such as
  • Universal Dependencies treebank project
  • ClueWeb
  • CommonCrawl
  • Linguistic Data Consortium

  • Experience working with manual annotation tools and platforms such as brat, WebAnno, Prodigy, Mechanical Turk, etc.

  • Nice to have:
  • Experience with databases such as SQL and Mongo
  • Proficiency in at least one natural language in addition to English
  • Experience with conversion, storage, version control and maintenance tasks for large multilingual text collections
  • Fluency in Mandarin Chinese

#LI-DNIBenefits at Babel Street (just to name a few...)

  • Health Benefits: Babel Street covers 90-100% monthly premium costs for Medical, Dental, Vision, Life & Disability insurances – for you and your family!
  • Retirement Plans: Babel Street offers both a Traditional and Roth 401(K) with a very competitive match.
  • Unlimited Flexible Leave: We trust our employees to manage their own time and balance their personal and work lives.
  • Holidays: Babel Street provides employees with 12 paid Federal Holidays
  • Tuition Reimbursement: We are committed to investing in our employees. One way we do that is with our Tuition Reimbursement Program for continuing education.

Babel Street is an equal opportunity/affirmative action employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. Further, Babel Street will not discriminate against applicants for inquiring about, discussing or disclosing their pay or, in certain circumstances, the pay of their co‐worker, Pay Transparency Nondiscrimination.In addition, Babel Street's policy is to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.

Upon request, we will provide you with more information about such accommodations.

This job is no longer open

Life at Babel Street

Discover what matters to you regardless of platform, language, or location.\nBabel Street enhances your capabilities for public search and makes analysts more efficient.\n\nWith advanced analytics, Babel Street makes sense of large tracts of multi-lingual data in near real-time. Babel Street assigns and charts sentiment for social media in over 20 major world languages. Users identify themes, entities, and categories, as well as detect relationships, within the cloud-based platform. Customers may access Babel Street 24/7/365 from any computer, device, or smartphone with an internet connection and a web browser.\n\nBabel Street offers a variety of products, built and crafted with the customer as our primary inspiration. From determining the best solution to assisting in mission support, Babel Street’s team of experts will ensure success at every point along the way.
Thrive Here & What We Value1. Unlimited Flexible Leave2. Health Benefits (90% monthly premium coverage for Medical, Dental, Vision, Life & Disability insurances)3. Retirement Plans (Traditional and Roth 401(K) with competitive match)4. Tuition Reimbursement Program5. Pay Transparency Nondiscrimination6. Equal Opportunity/Affirmative Action Employer7. Paid Federal Holidays (12 days)8. Holidayer-related policies
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2024