Careers
ML Training & Evaluation Intern
Support evaluation, experiments, and quality loops across Align, Orange, and future products.
Role overview
You’ll work on data preparation, prompt/retrieval experiments, benchmark runs, and dashboards that help us understand model behavior in real usage. This is practical ML: measurable, iterative, and tied to product outcomes.
Responsibilities
- Run evaluation harnesses and track quality changes over time
- Support prompt, retrieval, and dataset iteration experiments
- Help produce concise reports that translate ML results into product decisions
- Improve reliability of pipelines: logging, consistency checks, and documentation
Requirements
- Comfort with Python fundamentals and working with data
- Curiosity about model behavior and evaluation methodology
- Strong attention to detail (repeatability matters here)
Nice to have
- Experience with vector search / embeddings
- Dashboards or notebook workflows
- Basic LLM tooling familiarity
Internship compensation
Internships can be paid or unpaid depending on scope, location, and legal constraints. In the UK, unpaid arrangements are typically only feasible in limited situations (for example: genuine volunteering with charities/non-profits, short work-shadowing/observation, or certain structured course-related placements). If the role involves productive work that benefits the company and looks like a “worker” role, it generally must be paid at least the National Minimum Wage. We will always structure internships compliantly.
Note: This is general information and not legal advice. Final structure depends on the role and jurisdiction.
Company culture
Upsilon is product-led. We build and operate our own software products. We move fast, but we care deeply about clarity, reliability, and outcomes.
- We prefer measurable improvements over vague claims
- We iterate quickly, but we write things down clearly
- Quality and reproducibility are part of the job
How we work
- Short feedback loops, clear ownership, and high trust
- Documentation that’s minimal, but actually useful
- Execution quality matters — reliability is a feature
Apply
Ready to apply?
Love the role? Amazing. Applications aren’t open yet — but you can prepare your portfolio and check back when the window opens.