Data Engineer

ABOUT FORIAN INC.

 

Forian provides a unique suite of SaaS solutions, data management capabilities, and proprietary data and analytics to optimize and measure operational, clinical, and financial performance for customers within the traditional and emerging life sciences, healthcare payer and provider segments.

 

DESCRIPTION

At Forian, we’re tackling complex challenges in healthcare, life sciences, and consumer analytics by transforming a massive and ever-expanding data repository – multi-terabytes and growing – into high-quality, insightful data products. We’re not just moving data; we’re building a trusted foundation that empowers our clients and internal teams to make informed decisions with confidence. We’re seeking a passionate and skilled Data Engineer to join our collaborative team and be a key driver in designing, implementing, and continuously elevating the quality, relevance, and transparency of our data products. If you’re excited about building data solutions that are not only scalable but also inherently trustworthy and user-friendly, we encourage you to apply!

 

RESPONSIBILITIES

As a Data Engineer at Forian, you will be at the forefront of building and maintaining our core data and insight products. Your focus will be on ensuring that our data is not only processed efficiently at scale but is also highly relevant, traceable, and empowers independent data exploration. Your core mission will be to:

  • Architect and Build Intelligent Data Pipelines: Design and implement robust, scalable data ingestion, processing, and transformation pipelines that prioritize data quality, relevancy, and provenance.
  • Champion Data Quality, Relevancy, and Provenance: Implement rigorous data quality checks, monitoring, and lineage tracking to ensure accuracy, reliability, consistency, and a clear understanding of data origins and transformations.
  • Empower Data Users: Design data models and structures that are intuitive, well-documented, and facilitate efficient self-service data exploration and analysis.
  • Drive Continuous Improvement: Proactively identify opportunities to optimize data pipelines and architectures for performance, scalability, maintainability, and enhanced data quality and transparency.
  • Collaborate Across Teams: Partner closely with product managers, data scientists, analysts, and other engineers to deeply understand data needs and deliver solutions that are both technically sound and highly relevant to their objectives.
  • Leverage Modern Data Technologies: Utilize and explore cutting-edge cloud-based data technologies to build innovative, efficient, and transparent data solutions.
  • Contribute to Data Governance and Documentation: Participate in defining and implementing data governance policies, including data lineage standards, and contribute to comprehensive data documentation.

 

BASIC QUALIFICATIONS


We’re looking for a detail-oriented problem-solver who understands that data quality extends beyond just accuracy to encompass its relevance and traceability. Ideally, you’ll have:

 

  • A minimum of 3 years of professional experience in a Data Engineering role, with a demonstrable understanding of data quality principles.
  • A Bachelor’s degree in Computer Science, Engineering, a quantitative field (e.g., Statistics, Mathematics), or equivalent practical experience.
  • Solid understanding of data warehousing concepts, data modeling principles and ETL/ELT processes, with a strong emphasis on data quality and governance.
  • Proven ability to design, build, and maintain scalable data pipelines using technologies like Spark with a focus on incorporating data quality and lineage tracking.
  • Strong proficiency in at least one major programming language relevant to data engineering- at least Python.
  • Experience working with cloud platforms (e.g., AWS, Azure, GCP) and their data-related services (e.g., data lakes, data warehouses, serverless compute), with an understanding of how to leverage cloud features for data quality and governance.
  • A strong focus on data quality, including experience implementing data validation, monitoring, and data lineage tracking processes.
  • Excellent problem-solving and analytical skills with the ability to trace data issues back to their origin and implement robust solutions.
  • Strong communication and collaboration skills, with the ability to clearly articulate data flows, quality metrics, and data provenance to both technical and non-technical audiences.
  • A proactive and continuous learning mindset, eager to stay up to date with the latest data engineering trends and technologies, including data quality and data governance tools.
  • Experience working in an Agile development environment.

 

 


PREFERRED QUALIFICATIONS

 

We’re looking for a detail-oriented problem-solver who understands that data quality extends beyond just accuracy to encompass its relevance and traceability. Ideally, you’ll have:

 

  • A minimum of 3 years of professional experience in a Data Engineering role, with a demonstrable understanding of data quality principles.
  • A Bachelor’s degree in Computer Science, Engineering, a quantitative field (e.g., Statistics, Mathematics), or equivalent practical experience.
  • Solid understanding of data warehousing concepts, data modeling principles and ETL/ELT processes, with a strong emphasis on data quality and governance.
  • Proven ability to design, build, and maintain scalable data pipelines using technologies like Spark with a focus on incorporating data quality and lineage tracking.
  • Strong proficiency in at least one major programming language relevant to data engineering- at least Python.
  • Experience working with cloud platforms (e.g., AWS, Azure, GCP) and their data-related services (e.g., data lakes, data warehouses, serverless compute), with an understanding of how to leverage cloud features for data quality and governance.
  • A strong focus on data quality, including experience implementing data validation, monitoring, and data lineage tracking processes.
  • Excellent problem-solving and analytical skills with the ability to trace data issues back to their origin and implement robust solutions.
  • Strong communication and collaboration skills, with the ability to clearly articulate data flows, quality metrics, and data provenance to both technical and non-technical audiences.
  • A proactive and continuous learning mindset, eager to stay up to date with the latest data engineering trends and technologies, including data quality and data governance tools.
  • Experience working in an Agile development environment.

 

Forian Inc. is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

 

Job Categories: Data Data Engineering Healthcare
Job Types: Full Time
Job Locations: Remote - CT DE MA MD NJ NY PA only