We are hiring an AI Engineer (m/f/d) to holistically develop BIT Capital's LLM and agentic Artifical Intelligence capabilities: from the data science that turns data into investment signals, to the engineering that powers our financial intelligence agents. A strong initial focus is the build-out of “Aion” - our agentic research platform - that has first-class access to our data, models, and tools and is already in mainstream production, used by the investment team every day. Beyond, your mandate covers everything LLM- and agent-related across BIT, from research products for the investment team to internal tools in Sales, Marketing, Operations, and client-facing experiences.
You work autonomously, from first scoping to shipping, and you self-select the use cases and projects where you can have the most impact. You report to our Carlos Bielsa (Chief AI Officer & Managing Partner) and work side by side with Vlad Gheorghe, our second AI Engineer, acting as our internal expert as the frontier of AI moves. Our stack is Python, SQL, AWS, and Databricks, with models and agentic frameworks from the leading AI labs (Anthropic Managed Agents, Codex).
Your priorities in this role will thus include:
Investment signal pipelines. Design and ship LLM-based extraction pipelines (prompts, retrieval, model selection, evaluations) that turn text and data into investment scores, alerts, signals, and insights reliable enough to drive decisions.
Agentic systems behind “Aion”. Build and evolve the agentic systems that power Aion, along with the research skills and Python sidecars that agents execute.
Data and tool integrations. Build the integrations that give agents performant, safe access to BIT's data and capabilities.
Evaluations and model selection. Build automated and human-in-the-loop evaluations, regression tests, and monitoring that track accuracy, hallucination rates, latency, cost, and model uncertainty. Benchmark and select the right model per task, including open-source options.
LLMOps at scale. Establish the LLMOps practices that keep these systems reliable at scale: versioning of prompts, configs, and models; CI/CD; error handling; fallbacks; and cost control.
Frontier into production. Stay at the frontier of LLM and agentic AI techniques and bring new ideas into production rather than leaving them as experiments.
