Pubblicare uno stage
it
Offerta
Lavoro > Stage > IT/Tecnologia > Stati Uniti > San Jose > Offerta 

AI Agent Intern (Semantic Search)

NIO
Stati Uniti  San Jose, Stati Uniti
Stage, IT/Tecnologia, Inglese
4
Visite
0
Candidati
Registrarsi

Descrizione del lavoro:

JOB DESCRIPTION

About NIO

NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO's mission is to shape a joyful lifestyle. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.

NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.

NIO's product portfolio consists of the ES8, a six-seater smart electric flagship SUV, the ES7 (or the EL7), a mid-large five-seater smart electric SUV, the ES6, a five-seater all-round smart electric SUV, the EC7, a five-seater smart electric flagship coupe SUV, the EC6, a five-seater smart electric coupe SUV, the ET7, a smart electric flagship sedan, and the ET5, a mid-size smart electric sedan.

About the position

Join our AI Platform team to build intelligent agents that unlock the full value of our company's internal knowledge and data. You will work at the intersection of large language models, enterprise data systems, and agentic workflow design - creating tools that allow employees and systems to query, interpret, and act on everything from unstructured documents to live operational databases.
This internship is hands-on from day one. You will own a real workstream, collaborate with senior engineers and data architects, and ship features that are used in production before the summer ends.

Project scope

The core challenge: our company generates enormous volumes of internal data - design specifications, operational runbooks, relational databases, event streams, and data catalogs - yet most of it remains siloed and hard to access programmatically.
Your work will focus on building an AI agent layer that bridges natural language with these data sources. Specifically, you will tackle three interconnected areas:
*
Document intelligence - indexing and retrieving relevant content from design and operations documents using semantic search and RAG pipelines.
*
Structured data querying - enabling the agent to generate, validate, and execute SQL or API queries against both static (data warehouse) and real-time vehicle structured datasets.
*
Metadata-aware reasoning - integrating data catalog metadata so the agent understands schema context, data freshness, ownership, and access policies before surfacing results.

What you will learn

*
How to architect multi-step AI agents with tool-use, memory, and planning using frameworks such as LangChain, LlamaIndex, or custom orchestration layers.
*
Practical techniques for reasoning-based retrieval-augmented generation (RAG) over heterogeneous document collections.
*
Text-to-SQL generation, query validation, and safe execution against live databases with schema introspection and error-handling loops.
*
How to work with data catalogs and metadata stores to ground agent reasoning in authoritative schema information and governance constraints.
*
Evaluation methodology for agentic systems: how to measure accuracy, latency, tool-call efficiency, and hallucination rates in enterprise settings.
*
Collaborative engineering practices: code review, technical writing, and cross-functional communication with data, product, and operations stakeholders.

Deliverables

*
Document retrieval agent. A working RAG pipeline over internal design and ops documents with a conversational query interface.
*
Data query module. A text-to-SQL agent component capable of querying both static warehouse tables and real-time data sources.
*
Metadata integration layer. A connector links the agent to the data catalog so it can reason about schema, lineage, and freshness.
*
Evaluation report. A benchmark suite and written analysis covering accuracy, latency, and failure modes of the agent system.
*
End-of-internship demo. A live presentation to engineering and data leadership showcasing the system and key findings.

Qualifications

*
Currently study in graduate school of Computer Science, Data Science, Software Engineering, or a closely related technical field.
*
Solid Python programming skills, including familiarity with standard data and ML libraries (pandas, NumPy, PyTorch or similar).
*
Working knowledge of SQL - ability to write, read, and debug queries against relational databases.
*
Familiarity with large language model APIs (OpenAI, Anthropic, or equivalent) and at least one hands-on project using them.
*
Understanding of basic information retrieval concepts: similarity search, relevance search, and the fundamentals of RAG.
*
Ability to read and understand REST API documentation and integrate third-party services in code.
*
Strong written and verbal communication skills, with the ability to document technical work clearly for both technical and non-technical audiences.

Preferred qualifications

*
Currently pursuing a PhD Program is preferred.
*
Experience with agentic AI frameworks such as LangChain, LlamaIndex, AutoGen, or comparable libraries. Strong plus
*
Familiarity with data catalog or metadata management tools (Datahub, Amundsen, Apache Atlas, dbt docs).
*
Exposure to real-time or streaming data systems such as Apache Kafka, Spark Structured Streaming, or Flink.
*
Experience with cloud data platforms - Redshift, BigQuery or Databricks.
*
Knowledge of enterprise data governance concepts: data lineage, access control, PII classification, and schema versioning.
*
Prior internship or research experience in an AI, data engineering, or ML platform role. Strong plus

Compensation:
The US base salary range for this full-time position is $38.00 - $46.00.
* Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.
* Please note that the compensation details listed in US role postings reflect the base salary only. It does not include discretionary bonus, equity, or benefits

Provenienza: Web dell'azienda
Pubblicato il: 15 Apr 2026
Tipo di impiego: Stage
Durata di lavoro: 4 mesi
Compensation: 46 USD
Lingue: Inglese
Registrarsi
153.308 lavori e stage
in 159 Paesi
Registrati
Aziende
Offerte
Paesi