Systems Philosophy Stack Research Credentials Presence
Data Systems
Applied AI
Istanbul, TR

I build reliable pipelines.
Then I make them intelligent.

From staging layer to final output, I design systems that hold under real conditions. Then I add intelligence where it actually earns its place β€” not the other way around.

Rana Irem Turhan
Systems Engineer Data Scientist | Engineer | AI Practitioner
🏒 EPAM Systems
πŸ† Top 5 β€” AI Datathon
πŸŽ–οΈ SAP Young Professionals Program
πŸŽ–οΈ SAP End-to-End Systems
πŸŽ–οΈ SAP BTP GenAI Certified
πŸŽ“ ICADA 2026 Speaker
🌍 IELTS Academic 7.5 / CEFR C1
πŸŽ™οΈ DS & ML Hub β€” Community Lead & AI Content Creator

Not assignments. Systems.

Each of these represents a full design loop β€” a real problem, a deliberate approach, a trade-off made consciously, and a result that is measured and defended.

Time-Series ML Industrial Data Pipelines
πŸ† Flagship Initiative

Paint Factory Process Prediction

Predictive modeling of sensor behavior in a live industrial production environment β€” Eliar Elektronik Γ— Hacettepe AI Club

The Problem

Predicting critical tank fill levels in a continuous paint production line β€” none of it clean, none of it static. Required handling real-time sensor signals, irregular valve states, and automated dosing commands generated by industrial machinery.

What I Did Differently

Refused naive cross-validation. Designed sequential validation to preserve temporal logic and prevent leakage. Built process-aware features around sequence dependencies β€” industrial signals carry memory. Iterated with LightGBM, focusing on stability under real production conditions, not just leaderboard gains.

21.82 Final MAE Score

Outcome

Reliably outperformed baseline systems. Selected into the top 8 teams nationally. Invited to formally present and defend the architecture in Ankara before an academic and industrial professor jury.

Applied AI Product Discovery Responsible AI Build Hub

FoodVision AI

AI-Powered Nutrition Intelligence Platform | Team Lead

The Problem

People need more understandable, localized, and trustworthy nutrition guidance, especially when food information is fragmented across images, ingredient lists, and cultural food contexts.

What I Did

Leading a multidisciplinary team in MVP definition, user persona development, feature prioritization, product discovery, and AI architecture planning. Contributing to the design of a multi-layer system combining computer vision, LLM-based reasoning, structured food knowledge, explainability, and safety-focused response generation.

MVP Stage

Outcome

Currently in early-stage development, with a validated MVP direction, responsible AI considerations, and a clearer product roadmap for Turkish and international users.

Machine Learning Credit Risk Scoring

FinRisk-AI: Credit Risk Scoring

Developing an intelligent risk assessment architecture to optimize lending decisions.

The Problem

Traditional credit scoring mechanisms often plateau at roughly 60% accuracy, relying heavily on simple linear thresholds. These baseline models fail to capture non-linear financial behaviors, resulting in excessive defaults and lost opportunities.

What I Did Differently

Engineered an ensemble machine learning pipeline incorporating XGBoost and Random Forest, orchestrated through rigorous cross-validation. Deployed the resulting serialized model as an interactive application via Hugging Face Spaces for immediate stakeholder access.

75% Model Accuracy

Outcome

Elevated the baseline measurement from a stagnant 60% to a robust 75%, effectively proving the commercial value of ensemble trees in high-stakes financial risk modeling.

RAG Infrastructure Generative AI

RAG-Based SQL Interpretation System

A retrieval-augmented pipeline executing precise database queries over complex schemas β€” Global AI Hub Γ— Akbank

The Problem

LLMs consistently hallucinate SQL structures when untethered. Without rigorous grounding in a defined schema context, generated analytics queries fail silently, projecting high confidence on corrupted data.

What I Did Differently

Built a FAISS vector retrieval layer mapping to specific normalized entity embeddings. Developed structured, strict prompt templating boundaries to inject context safely, and cut latency via prefix query filtering.

~90% Query Accuracy

Outcome

Engineered and deployed a fully functional MVP within one week, dropping systemic hallucination rates drastically while reducing retrieval latency by 35%.

Data Warehouse Architecture ETL Reliability

Multi-Layer Warehouse Reconstruction

End-to-end analytical architecture mapping raw staging areas through 3NF up to a modeled data mart β€” EPAM Systems

The Problem

Analytical stakeholders were executing demanding queries directly against fragile operational databases. The ecosystem lacked separation of concerns, mutation tracking, or recovery strategies for failed pipelines.

What I Did Differently

Migrated infrastructure to a strict Staging β†’ 3NF β†’ Data Mart paradigm. Enforced Slowly Changing Dimensions (SCD Type 1/2) and built heavily idempotent ETL jobs backed by robust transaction controls.

25% Less Redundancy

Outcome

Successfully decoupled BI layers from operational schema fluctuations. Eliminated silent reporting failures by establishing fully auditable, rerunnable ingestion pipelines.

Analytical Dashboards Statistical Modeling

Climate Dashboard & Insight Explorer

Long-range climate visualization and correlation analytics engine.

The Problem

The bottleneck wasn't the data β€” it was access. Translating decades of raw atmospheric and temperature datasets into immediate, actionable insights for non-technical researchers without losing statistical rigor.

What I Did Differently

Built highly reusable Python data pipelines leveraging Pandas for ingestion and Scikit-learn for correlation modeling. Architected a live frontend employing Dash and Plotly for deep interactive visualization.

Live Analytics Platform

Outcome

Delivered a thesis-style analytics application capable of rendering long-range correlations dynamically via parameter injection from the UI.

How I evaluate systems.

Data engineering should be invisible when functioning properly. The structural decisions made in the foundational ingestion layer irrevocably dictate the integrity of every subsequent analytic.

01. Pipeline as a Contract

Make the promises explicit.

Every ETL workflow makes implicit promises downstream. I make those promises explicit by integrating idempotency, transaction commits, and structured error boundaries at deployment, not post-failure.

02. Architectural Trust

Speed without separation is debt

Raw ingestion, normalized 3NF, and dimensional models aren't bureaucratic fillerβ€”they represent strict ownership boundaries. Collapsing layers merely to accelerate immediate delivery is borrowing heavily against future stability.

03. Grounded Intelligence

AI needs a boring foundation.

Any intelligent model is fundamentally capped by the reliability of its input features. I architect AI capabilities exclusively atop sanitized, well-modeled schemas. Reliable engineering renders intelligence trustworthy.

04. Technical Translation

If it can’t be explained, it won’t be used

Sophisticated analytical work holds zero leverage if misunderstood by decision-makers. I prioritize translating pipeline latency, predictive confidence, and data quality into universally resonated business narratives.

Systems over tools.

Languages and frameworks rotate over time. The structural methodologies remain constant. Below are my core areas of execution.

Enterprise Systems & SAP

Navigating and deploying within massive enterprise resource ecosystems.
Why it matters: Enterprise stacks require strict governance and deep integration knowledge to prevent silos.
SAP Young Professionals Program & Certifications.
SAP S/4HANASAP BTPSAP Analytics Cloud

Data Architecture Strategies

Designing heavily layered environments establishing clear partitions between ingestion, curation, and analytics.
Why it matters: Collapsing layers to accelerate immediate delivery borrows heavily against future stability.
Applied extensively across EPAM integration workflows.
PostgreSQLAmazon RedshiftAWS S3SCD Architecture

Robust ETL & Validation

Writing immutable, idempotent pipelines designed to fail predictably and restart seamlessly.
Why it matters: Silently failing analytics pipelines create organizational mistrust in the fundamental data layers.
Core methodology for stabilizing fragmented upstream data.
PythonSQLAmazon RDSDynamoDB

Applied ML & Vector Systems

Constructing time-series feature environments and RAG layers specifically for production.
Why it matters: Any intelligent model is fundamentally capped by the reliability of its input streams.
Validated via Top 5 National Datathon placement.
LightGBMFAISSHugging FaceEnsembles

The boundaries of my work.

Where research meets production constraints.

ICADA 2026 β€” Accepted Paper & Conference Presenter

ICADA 2026 – International Artificial Intelligence and Data Science Congress

Paper: A Portable Provenance Schema for Auditable Clinical NLP

Presented an applied AI research paper focused on provenance, auditability, and reliability in clinical NLP systems at ICADA 2026.

Data Science & ML Hub

Community Leadership & Education

Recognized as an AI Content Creator and awarded "Rising Voice of the Month". Passionate about translating technically dense conceptsβ€”like ETL mechanics and deployment nuancesβ€”for learners and non-experts.

Enterprise Certifications

SAP & Cloud Ecosystems

Graduate of the SAP Young Professionals Program. Holding verified credentials including SAP Certified Associate – Implementation Consultant and SAP Certified – SAP Generative AI Developer. Profound exposure to end-to-end business models via SAP S/4HANA.

Verified. Documented.

Certifications, test scores, and artifacts that back the work.

Systematic translation.

My foundational reflex is identical regardless of the domain: I must interpret the macro-system before mutating the micro-parts. This instinct guides my data architecture, but it also dictates my work outside the terminal.

I operate extensively in narrative storytelling and voice delivery. While it appears to be separate from engineering, it relies on the same core principle: absorbing complexity and delivering it so the intent is understood without friction.

Explaining systems is not an accessory β€” it is a constraint. If something cannot be made clear without distortion, the design itself is incomplete.

I am drawn to work at the intersection of architecture and execution β€” where understanding the full system matters as much as delivering a clean result.

Whether I’m designing a data pipeline or shaping a narrative, the objective is the same: preserve intent, reduce noise, and make the outcome usable.

Intersecting Domains

  • Narrative Voice Acting
  • Technical Concept Simplification
  • Cross-Disciplinary Team Parsing
  • Public Academic Defense