Remote Data Engineer
hirecrap
Job Description
- Work with structured and unstructured datasets to support SWE Bench-style evaluation tasks.
- Design, build, and validate data pipelines used in benchmarking and evaluation workflows.
- Perform data processing, analysis, feature preparation, and validation for data science use cases.
- Write, run, and modify Python code to process data and support experiments locally.
- Evaluate data quality, transformations, and outputs for correctness and reproducibility.
- Create clean, well-documented, and reusable data workflows suitable for benchmarking.
- Participate in code reviews to ensure high standards of code quality and maintainability.
- Collaborate with researchers and engineers to design challenging, real-world data engineering and data science tasks for AI systems.