remote
AI Engineer Gen AI - DICETEK LLC
AI Engineer
Design and deploy generative AI solutions with large language models, autonomous agents, and RAG architectures, optimizing performance and integrating with enterprise systems on cloud platforms.
About the role
Key Responsibilities
- Design, develop, and fine‑tune Large Language Model (LLM) based applications and generative AI products.
- Build autonomous and multi‑agent systems using agentic frameworks and workflow engines.
- Implement Retrieval‑Augmented Generation pipelines to enhance knowledge‑grounded responses.
- Integrate AI solutions with existing enterprise services, APIs, and data stores.
- Deploy, monitor, and scale models on cloud platforms such as AWS or Azure, ensuring reliability and cost efficiency.
Requirements
- Strong proficiency in Python and experience with deep‑learning libraries like PyTorch or TensorFlow.
- Hands‑on experience developing, fine‑tuning, and serving LLMs (e.g., GPT, LLaMA) and using frameworks such as LangChain.
- Knowledge of agentic AI concepts, RAG architectures, and prompt engineering.
- Proven ability to deploy AI workloads on cloud infrastructure (AWS, Azure) and manage containerized environments.
- Solid understanding of software engineering best practices, version control, and CI/CD pipelines.
Skills
pythonpytorchtensorflowlangchainaws