Job Overview
Job TitleDocument Sourcing Specialist
Job TypeContractor
Offered Salary$60,000
LocationRemote
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
Document Sourcing Specialist
Join our customer's team as a Document Sourcing Specialist, where your keen eye for detail and passion for compliance will directly impact the quality of data used in AI training. In this fully remote role, you will identify, verify, and source open-access documents from a variety of reputable repositories to ensure they meet stringent licensing requirements.
Key Responsibilities
- Source publicly available documents from platforms such as government archives, academic repositories, open datasets, and licensed open-source documentation.
- Verify and document the license type of every sourced document, ensuring strict adherence to requirements such as CC0, CC-BY, MIT, or Apache 2.0 (or equivalent).
- Log critical metadata for each submission, including source URLs and full license details, in designated tracking tools.
- Flag and annotate any issues related to ownership, unclear licensing, paywalled access, or content with non-commercial usage restrictions.
- Collaborate with data engineering and compliance teams to clarify requirements and resolve sourcing ambiguities.
- Maintain up-to-date knowledge of open data best practices, licensing changes, and repository navigation strategies.
- Communicate findings and unresolved issues clearly in both written and verbal form, supporting documentation integrity and compliance audits.
Required Skills and Qualifications
- Exceptional attention to detail and ability to accurately review complex licensing and compliance information.
- Experience sourcing documents from repositories such as SEC EDGAR, arXiv, Kaggle, and GitHub.
- Proficiency in academic research, data collection, and public records searching.
- Strong written and verbal communication skills, able to articulate findings and collaborate remotely.
- Demonstrated ability to distinguish between open and restricted content, and to identify potential sourcing risks.
- Comfort working independently in a fast-paced, remote environment with evolving priorities.
- Highly organized, reliable, and adept at managing and documenting large volumes of information.
Preferred Qualifications
- Prior experience supporting AI or machine learning projects with high-quality data sourcing.
- Familiarity with open-source licensing and data compliance regulations.
- Background in academic research, information science, or legal review.
Key skills/competency
- Document Sourcing Specialist
- Data Sourcing
- Compliance
- Licensing
- AI Training Data
- Metadata
- Open Access
- Data Integrity
- Remote Work
- Attention to Detail
How to Get Hired at Micro1
- Tailor your resume: Highlight experience in document sourcing, licensing, and compliance.
- Showcase attention to detail: Provide examples of meticulous data verification and metadata logging.
- Emphasize remote work skills: Demonstrate strong communication and independent work ethic.
- Research compliance: Understand open-source licenses and data regulations relevant to AI.
- Prepare for collaboration: Be ready to discuss cross-functional teamwork with data and compliance teams.
Frequently Asked Questions
Find answers to common questions about this job opportunity
01What are the primary responsibilities of a Document Sourcing Specialist at micro1?
02Is this Document Sourcing Specialist role remote?
03What kind of documents will I be sourcing for this role?
04What are the essential qualifications for the Document Sourcing Specialist position?
05Does micro1 prefer candidates with AI or machine learning project experience for this role?
06What are common open-source licenses I should be familiar with for this role?
07How important is attention to detail for a Document Sourcing Specialist?
08What tools might a Document Sourcing Specialist use at micro1?
Explore similar opportunities that match your background