PitchMeAI
EQL Global

Dataspecialist

EQL Global · Berlin, Germany

  • On site
  • Full-time
  • $120,000 / year
  • Berlin, Germany

Job highlights

  • Lead design and operation of market data acquisition stack.
  • Build high-throughput pipelines and parsing infrastructure.
  • Solve hard problems with anti-bot defenses and parsing.
  • Requires 4+ years in production data acquisition.
  • Work on a compliance-first AI platform.

About the role

About EQL Global

EQL Global is the compliance-first AI workflow platform for institutional equity research in European capital markets. Our coverage spans 33,000+ publicly listed companies across 89 countries, surfaced to equity analysts and portfolio managers at Nordic and European financial institutions through AI agents and structured APIs. The data acquisition layer underneath that coverage is the spine of the product. We're hiring the engineer who will own it.

The Role

You will lead the design and operation of EQL's market data acquisition stack — the systems that continuously source, parse, and normalize financial information at scale. You'll work directly with the CPO and technical staff. The work is high-leverage and visible. This is not a generic data engineering role. We need someone who has built acquisition systems in production, has informed opinions about Playwright vs. Scrapy vs. Crawlee, and knows what it takes to keep multi-jurisdiction pipelines running when upstream formats change overnight.

What you will build

  • High-throughput acquisition pipelines spanning thousands of issuers and tens of jurisdictions
  • Parsing infrastructure for filings and reports — PDF (text and scanned / OCR), iXBRL / ESEF, HTML, multilingual content
  • Schema-resilient extractors that detect and recover from upstream changes without silent data loss
  • Scheduling, queueing, and retry systems that keep latency-sensitive feeds flowing in near real time
  • Quality gates — validation, reconciliation, deduplication, anomaly detection — that protect downstream products
  • APIs and stream interfaces that expose this data cleanly to internal AI agents and external clients

The hard problems you will own

  • Modern anti-bot defenses and the cost / throughput economics of working through them lawfully
  • JavaScript-heavy pages and dynamic content
  • Multilingual document parsing across many jurisdictions, each with its own conventions
  • OCR pipelines for scanned filings still common in several markets
  • Schema drift and silent-breakage detection across thousands of distinct formats
  • Cost / throughput tradeoffs across headless browser fleets, HTTP clients, and direct integrations
  • Sound judgment about when to acquire openly, when to license, and when to integrate via official feeds

You will fit if you have

  • 4+ years building production data acquisition systems, ideally in financial, legal, or other structured-document domains
  • Deep Python — async/await, aiohttp / httpx, asyncio patterns at scale
  • Hands-on experience with Playwright, Puppeteer, Scrapy, Crawlee, or equivalent
  • Strong document parsing skills — PDF (pdfplumber, PyMuPDF), HTML (lxml, parsel), iXBRL, OCR
  • Comfort with queueing / orchestration (Celery, Temporal, Airflow, or similar) and Postgres at scale
  • A track record of keeping pipelines alive — monitoring, alerting, drift detection, recovery
  • Sound judgment about acquisition strategy and source legitimacy

Nice to have

  • Experience with financial filings and disclosure regimes (annual reports, prospectuses, transparency disclosures)
  • Background in equity research, capital markets, or fintech
  • Familiarity with iXBRL / ESEF reporting standards
  • Swedish or another Nordic language

How we work

EQL is a compliance-first platform. Our acquisition practices are built around lawful data use — we work within source terms, document our basis for collection, and maintain audit trails. If you've felt uncomfortable with how some shops cut corners, you will find this a clean place to do the work.

Key skills/competency

  • Data Acquisition
  • Python
  • Playwright
  • Scrapy
  • Crawlee
  • Document Parsing
  • OCR
  • iXBRL
  • Queueing Systems
  • Postgres

Skills & topics

  • Data Acquisition Engineer
  • Python
  • Data Engineering
  • Financial Markets
  • AI
  • Compliance
  • Scraping
  • Document Parsing
  • OCR
  • iXBRL

How to get hired

  • Tailor your CV: Highlight your experience with production data acquisition systems, Python, and specific tools like Playwright or Scrapy.
  • Craft a compelling note: In your application, detail an acquisition system, parser, or pipeline you're proud of, providing links or descriptions.
  • Demonstrate your expertise: Showcase your deep Python skills, experience with async/await, and familiarity with queueing systems like Celery or Temporal.
  • Emphasize problem-solving: Mention your track record of keeping pipelines alive and your sound judgment on acquisition strategy.
  • Understand compliance: Convey your commitment to lawful data use and documentation, aligning with EQL Global's principles.

Technical preparation

Master Python async/await and asyncio patterns.,Practice with Playwright, Scrapy, or Crawlee.,Build PDF, HTML, and iXBRL parsers.,Implement queueing and monitoring systems.

Behavioral questions

Describe a complex data acquisition system you built.,How do you handle changing data formats overnight?,Explain your approach to anti-bot defenses.,How do you ensure data quality and prevent loss?

Frequently asked questions

What is the primary focus of the Senior Data Acquisition Engineer role at EQL Global?
The Senior Data Acquisition Engineer at EQL Global will lead the design and operation of the market data acquisition stack, focusing on sourcing, parsing, and normalizing financial information at scale for European capital markets.
What technical skills are essential for this position at EQL Global?
Essential technical skills include 4+ years of experience building production data acquisition systems, deep Python expertise (async/await, asyncio patterns), hands-on experience with tools like Playwright or Scrapy, strong document parsing capabilities (PDF, HTML, iXBRL, OCR), and familiarity with queueing/orchestration systems and Postgres.
How does EQL Global ensure compliance in its data acquisition practices?
EQL Global is a compliance-first platform. Data acquisition practices are built around lawful data use, adhering to source terms, documenting the basis for collection, and maintaining audit trails.
What kind of challenging problems will a Senior Data Acquisition Engineer tackle at EQL Global?
Challenges include overcoming modern anti-bot defenses, handling JavaScript-heavy dynamic content, multilingual document parsing across jurisdictions, OCR for scanned filings, detecting schema drift, and managing cost/throughput tradeoffs for data acquisition.
What is the application process for the Senior Data Acquisition Engineer role at EQL Global?
To apply, send a CV and a short note, ideally highlighting a data acquisition system you are proud of, to sahng.ibrahim@eqlglobal.com. Applications are reviewed on a rolling basis.
Is this a remote or hybrid position at EQL Global?
The Senior Data Acquisition Engineer role is offered as remote within the EU timezone or hybrid in Stockholm/Göteborg.
What is the expected experience level for this role at EQL Global?
The role requires 4+ years of experience building production data acquisition systems, ideally within financial, legal, or structured-document domains.
What makes this data engineering role unique at EQL Global?
This is not a generic data engineering role; EQL Global seeks an engineer with proven experience in building production acquisition systems, informed opinions on scraping tools, and the ability to maintain multi-jurisdiction pipelines through changing formats.