Data Engineer
reczee
Job Description
Key Responsibilities
- Design, and maintain scalable ETL/ELT pipelines
- Build and manage data ingestion workflows from multiple data sources
- Transform raw data into clean, usable datasets for analytics and reporting
- Develop and maintain data models, tables, and warehouse structures
- Optimize database performance, query efficiency, and pipeline reliability
- Monitor data jobs, troubleshoot failures, and ensure data quality
- Work with batch and near real-time data processing systems
- Collaborate with cross-functional teams to gather data requirements
- Support BI dashboards, reporting systems, and downstream analytics use cases
- Maintain documentation for pipelines, schemas, and data workflows
- Follow best practices for data governance, security, and scalability
Required Skills & Qualifications
- Bachelor’s degree in Computer Science, Engineering, Information Technology, or related field
- 3–4 years of experience in data engineering or a related field
- Strong proficiency in SQL
- Hands-on experience with Python or Java
- Experience building and maintaining ETL/ELT pipelines
- Good understanding of data warehousing concepts and data modeling
- Experience with tools/frameworks such as Kafka,Airflow, Spark or similar
- Familiarity with cloud platforms like AWS,GCP
- Experience with data warehouses such as Clickhouse,Snowflake, BigQuery, Databricks, or similar
- Strong understanding of relational and non-relational databases
- Experience in performance tuning and troubleshooting data workflows
- Good communication and collaboration skills
Good to have
- Experience with real-time/streaming pipelines
- Familiarity with CI/CD and DevOps practices
- Exposure to containerization tools like Docker
- Understanding of data quality and observability tools
- Experience working in product-based or fast-paced startup environments
What We’re Looking For
- Strong problem-solving ability
- Ownership mindset and attention to detail
- Ability to work independently and collaboratively
- Passion for building reliable and scalable data systems