Our client Ivector is looking for a Senior AI Engineer in Islamabad
Join ivector as a Senior AI Engineer to lead the design and development of advanced artificial intelligence systems that operate efficiently at scale.
You are perfect candidate if you have shipped production AI systems end to end — distributed services, trained models, and retrieval pipelines. You own architecture decisions, you measure what you ship, and you reason about cost, failure, and quality before writing code. This is a broad, high-ownership role spanning automation infrastructure, applied ML, and knowledge/RAG systems. We are not looking for someone who has done all of it perfectly; we are looking for someone who has done enough of it in production to learn the rest fast.
About Us
We're building the AI system that automates the broker's real work end to end: logging into hostile third-party portals, reading messy regulated documents, scoring risk, and making recommendations, reliably enough to trust in a domain where mistakes have consequences. The hard problems here are the good kind: anti-bot evasion at scale, applied ML on real outcome data, and RAG that has to cite the right clause from the right document version, every time.
We're a startup, which means you'll own architecture, not tickets. A few others are chasing this space, we intend to outbuild them by shipping fast and shipping things that actually work. Leading that charge is our CTO, Zain Ali, who recently left Google to build this, after engineering at Netflix and Microsoft. You'll work directly under that bar.
If you want to build the thing that defines a category, not maintain something that already won, this is the room to be in.
Responsibilities
- The Work
Distributed Automation
- Rearchitect a single-machine automation engine into a system handling 1,000+ concurrent jobs against third-party portals: microservices, persistent browser session pools, container orchestration
- Design the job queue and execution model: dispatch, retry, per-target rate limiting, circuit breakers, real-time status feed to the user-facing UI
- Build an onboarding engine that, given a portal URL and credentials, discovers fields, maps them to our internal schema, and generates a working automation script with no manual scripting
ML & Model Quality
- Replace heuristic scoring with supervised models trained on historical outcome data; design and train domain-specific risk and recommendation models
- Benchmark against LLM-only baselines and pick the right tool per problem
- Stand up eval harnesses on every model, runnable on demand and on a schedule, that catch regressions before production
- Own the R&D track for self-hosted inference (LLMs, embeddings, ASR, TTS) and deliver cost-vs-quality memos that drive build/buy decisions
Hard Requirements
- Python (FastAPI, async/await): production service design, not scripts
- Distributed job execution with Celery + Redis or equivalent, Postgres as job store; you have debugged retry storms and backpressure in production
- Kubernetes or Docker Swarm: deployed and operated, not tutorial-level
- Browser automation at scale: Playwright or Puppeteer handling MFA flows, session persistence, and anti-bot systems (isTrusted events, PerimeterX-class protection)
- Python ML stack: PyTorch or equivalent, scikit-learn, HuggingFace; you build training pipelines, not just call APIs
- Supervised learning and embeddings: labels, class imbalance, and clean feature pipelines from messy source data
- Eval-first mindset: every model and knowledge system ships with a benchmark for correctness, freshness, and attribution
- You reason about cost-per-job, failure blast radius, and queue depth before writing code
Strong Plus
- LLM integration for DOM and field inference (vision models)
- LLM fine-tuning / RLHF
- SageMaker or equivalent for model serving; inference cost benchmarking across providers (Anthropic, OpenAI, self-hosted)
- Regulated-domain experience (insurance, fintech, health, legal) — knowing what you are reading matters
- Web scraping and change-detection infrastructure (PDF diffing, DOM monitoring)
- DigitalOcean infrastructure experience
