Business Services & Consulting • all cities, SD 42
AI Engineer (42)
all cities, SD 42On-sitePosted 1 day ago
Business Services & Consulting
About the Role
AI Engineer
We are looking for a skilled AI Engineer with a strong background in FastAPI framework and experience in handling background tasks. The ideal candidate will be responsible for designing, implementing, and maintaining robust, scalable APIs for our product/service. You will collaborate closely with cross-functional teams to deliver high-quality solutions that meet our clients' needs.
Key Responsibilities:
Develop and maintain APIs using the FastAPI framework.
Implement background tasks and asynchronous processing to optimize system performance.
Designing and building LLM-powered features and pipelines (RAG, agents, tool use, multi-step reasoning chains)
Context engineering — structuring prompts, managing context windows, optimizing token usage, implementing retrieval strategies to feed LLMs the right information at the right time
Working with LLM APIs (Claude, OpenAI, etc.) and orchestration frameworks
Building evaluation and testing frameworks for LLM outputs (accuracy, hallucination detection, regression testing)
Implementing guardrails, content filtering, and responsible AI patterns
Optimizing latency and cost for LLM-heavy workloads (caching, streaming, batching)
Write clean, efficient, and maintainable code that adheres to industry best practices.
AI Engineer
We are looking for a skilled AI Engineer with a strong background in FastAPI framework and experience in handling background tasks. The ideal candidate will be responsible for designing, implementing, and maintaining robust, scalable APIs for our product/service. You will collaborate closely with cross-functional teams to deliver high-quality solutions that meet our clients' needs.
Key Responsibilities:
Develop and maintain APIs using the FastAPI framework.
Implement background tasks and asynchronous processing to optimize system performance.
Designing and building LLM-powered features and pipelines (RAG, agents, tool use, multi-step reasoning chains)
Context engineering — structuring prompts, managing context windows, optimizing token usage, implementing retrieval strategies to feed LLMs the right information at the right time
Working with LLM APIs (Claude, OpenAI, etc.) and orchestration frameworks
Building evaluation and testing frameworks for LLM outputs (accuracy, hallucination detection, regression testing)
Implementing guardrails, content filtering, and responsible AI patterns
Optimizing latency and cost for LLM-heavy workloads (caching, streaming, batching)
Write clean, efficient, and maintainable code that adheres to industry best practices.
What You'll Do
Develop and maintain APIs using the FastAPI framework.
Implement background tasks and asynchronous processing to optimize system performance.
Designing and building LLM-powered features and pipelines (RAG, agents, tool use, multi-step reasoning chains)
Context engineering — structuring prompts, managing context windows, optimizing token usage, implementing retrieval strategies to feed LLMs the right information at the right time
Working with LLM APIs (Claude, OpenAI, etc.) and orchestration frameworks
Building evaluation and testing frameworks for LLM outputs (accuracy, hallucination detection, regression testing)