
AI / LLM Engineer
Job Description
Posted on: August 11, 2025
We are looking for an AI Engineer with hands-on experience in Large Language Models (LLMs) to integrate intelligent features into our software product. This role focuses on Java backend development and requires expertise in both cloud and offline/on-premises AI solutions. Key Responsibilities
- Integrate LLMs (e.g., GPT-4, Falcon, LLaMA, Mixtral) into Java backend systems.
- Develop local services (in Python or Java) to serve offline models when needed.
- Design and maintain REST/JSON endpoints for communication between Java services and AI modules.
- Personalize and adapt model outputs through prompt engineering.
- Implement logic for natural language understanding, question/answer generation, and response analysis.
- Support hybrid architecture: cloud-first with fallback or dedicated on-premises mode.
- Ensure data privacy, performance, and security in AI integrations.
- Collaborate with backend, frontend (Angular), and product teams for seamless integration.
Required Skills & Experience
- Experience with LLMs (e.g., GPT, Falcon, LLaMA, BloomZ).
- Experience integrating APIs (OpenAI, HuggingFace, Ollama).
- Strong Python and Java skills for backend development (FastAPI, Flask).
- Expertise in Java backend development, especially with Spring Boot.
- Familiarity with AWS services (API Gateway, EC2, Lambda, etc.).
- Experience deploying AI models in on-premises environments.
- Familiar with model quantization and serving tools (HuggingFace, llama.cpp, Ollama).
Nice To Have
- Familiarity with LangChain, vLLM, or Retrieval-Augmented Generation (RAG).
- Experience with multilingual prompt engineering.
- Working knowledge of Angular.
- Experience with AI solutions in offline enterprise environments.
- Knowledge of privacy regulations (e.g., GDPR) and edge computing best practices.
Who You Are
- Solution-oriented, with strong problem-solving skills.
- Comfortable working autonomously and taking technical ownership.
- Eager to collaborate with cross-functional teams.
- Curious and passionate about exploring new AI technologies.
What We Offer
- An innovative product focused on real-world Generative AI.
- Influence in technical decisions and solution architecture.
- Flexible, remote work with autonomy.
- Growth opportunities with modern tools and open-source models.
If you’re excited about making an impact in the AI space, we’d love to hear from you! Apply now and join our dynamic team Skills: model quantization,natural language understanding,huggingface,python,spring boot,rest/json,api integration,artificial intelligence,ollama,large language models,java,large language models (llms),flask,falcon,aws,fastapi
Apply now
Please let the company know that you found this position on our job board. This is a great way to support us, so we can keep posting cool jobs every day!

Remote-Work.app
Get Remote-Work.app on your phone!

Machine Learning Engineer - 100% remoto

Junior Pharmacovigilance Project Manager

Business Development Executive - Growth opp in High Ticket Education

Workday Payroll Implementation Consultant
