import pandas as pd from azure.data.factory import * def build_pipeline(source, sink): df = pd.read_sql(query, conn) df = df.dropna(subset=['id']) return transform(df) SELECT d.*, q.score FROM datasets d JOIN quality_checks q ON d.id = q.dataset_id WHERE q.score > 0.95; pipeline.run(schedule='daily')
Roham Chlak

// Hello World, I'm

Roham Chlak

رُهام شلاق

Data professional with hands-on experience in data engineering, data quality, governance, and analytics. Skilled in building scalable ETL pipelines, profiling datasets, and delivering actionable insights across enterprise organizations in the UAE.

work_experience

// career.log

Data Engineer - Manager

Capgemini Invent
Jun 2025 — Jun 2026 Dubai, United Arab Emirates

Leading data engineering initiatives and managing end-to-end data pipelines for enterprise clients. Driving cloud-based data platform modernization and implementing best practices for data architecture and governance across multiple engagement streams.

Data Quality Analyst

General Pension and Social Security Authority (GPSSA)
Jun 2023 — May 2025 Abu Dhabi, United Arab Emirates

Developed and published Open Data datasets along with comprehensive metadata and data dictionaries to support TDRA annual assessment. Coordinated with internal stakeholders, Bayanat.ae, and the GPSSA website to ensure alignment with national Open Data standards. Conducted data profiling and identified data quality issues across multiple data sources to support data governance efforts. Integrated and enriched GPSSA datasets by consolidating inputs from various data owners. Reconciled and validated data post-migration to ensure consistency and accuracy. Analyzed service requests for accuracy and relevance, generating actionable reports for business units. Performed statistical analysis to assess and monitor data quality metrics. Designed dashboards and visualizations in SAS, and delivered daily and ad-hoc reporting to stakeholders.

Data Scientist & Developer

Internal Security Forces (ISF)
2013 — 2022 Lebanon

Python coding / Scripting / Machine learning algorithms / Spark (PySpark) / Hive (on top of Hadoop) / Elastic Search, Kibana / Siren (Graph) / NiFi (Data Transformation and Processing and also used as ETL).

Engineering Design & Manufacturing Developer

Engineering Design & Manufacturing (EDM)
Jan 2011 — Dec 2012 Lebanon

Delphi Forms / Pascal Programming Language - working on their own system (EDM) for business solutions (P.O.S system, stock system, accounting system).

Developer

Allied Computer Incorporation (ACI)
2010 — 2011 Lebanon

Delphi Forms / Pascal Programming Language - Working on ARAM Accounting system (Saudi version).

Designer & Publisher

Lebanese Preparatory School (LPS)
2009 — 2009 Lebanon

Designing school agendas using Adobe Illustrator. Publication of computer knowledge books for grades 1 to 12 after they have been translated from English to French.

education

// degrees.json

BSc in Computer Engineering (CENG)

LIU University

2019 — 2022 Beirut, Lebanon

Focused on computer engineering fundamentals, software development, networking, and systems design.

BSc in Computer Engineering

CNAM University

2010 Lebanon

Foundation in computer engineering, programming, algorithms, and data structures.

services

// what_i_do.ts

Data Engineering

Python Scripting and PySpark for data processing and cleansing from different sources (CSV, Excel, Access, MySQL, SQL Server, Oracle, etc.) to big data ecosystem (Hadoop — HDFS — and ElasticSearch).

Web Development

C# MVC 5, Django, WordPress, Angular, PHP & MySQL solutions.

Data Science / AI

Machine Learning and Deep Learning models. Statistical analysis using R and Python. NLP and predictive analytics.

Windows Applications

Java development. Delphi Forms / Pascal programming for business solutions (POS, stock, accounting).

Training

Python course training. PySpark for big data processing. Hadoop ecosystem and pipeline design workshops.

Trainings & Professional Development

// certifications.log

OSINT & Intelligence Gathering

Softech Beirut, Lebanon
December 2020
  • Training on OSINT methodologies and investigative tools
  • Hands-on use of PeopleMon and OSIntMon platforms

Cybersecurity Awareness & Threat Mitigation

Ministry of Administrative Reform (OMSAR) Beirut, Lebanon
July 2020
  • Cybersecurity threat awareness and attack mitigation practices
  • Security best practices and organizational risk reduction strategies

Big Data Architecture & Data Processing

Cognitus SAS Dubai, UAE
February 2019
  • Python scripting and data processing automation
  • Distributed and scalable Big Data architecture design
  • Apache NiFi for data flow, transformation, and processing
  • Technical coordination with development teams and stakeholders

Machine Learning & Big Data Engineering

Cognitus SAS France
November 2018
  • Big Data processing using SQL, Apache Spark (PySpark), Hive, and Elasticsearch/Kibana
  • Design and implementation of Big Data architectures
  • Performance optimization of legacy systems
  • Machine Learning and Deep Learning model training/testing

Statistical Data Analysis with R

Cognitus SAS
July 2018
  • Statistical analysis and data modeling using R and RStudio

Hadoop Ecosystem & Big Data Solutions

Cognitus SAS Beirut, Lebanon
January 2018
  • Big Data ecosystem architecture and deployment
  • Hadoop ecosystem for storage, streaming, processing, and visualization

Oracle Database & Exadata Storage Management

New Horizons Dubai, UAE
July 2017
  • Oracle 12c and Exadata Storage Management

tech_stack

// skills.config
Analytics
SASStatistical Analysis
Big Data
Apache SparkElasticSearchHadoop / HDFSHiveKibanaNiFiPySparkSSISSiren Graph
Cloud
AWS GlueAzure Data FactoryAzure DatabricksAzure Synapse
Data Science
Deep LearningMachine LearningNLP
Database
MS SQL ServerMySQLOracle 12c / EXADATAOracle DatabaseOracle Forms & ReportsPostgreSQL
Domain
Data GovernanceData ManagementOpen Data Standards
Engineering
Data MigrationData WarehousingETL Pipelines
Programming
AngularC# MVC5Delphi / PascalJavaNumPyPHPPandasPythonRSQL
Quality
Data CleansingData Profiling
Tools
Adobe IllustratorOSINT ToolsR Studio
Visualization
Power BISAS Visual AnalyticsTableau
Web Development
DjangoWordPress