Kyle Naranjo

I build data and AI infrastructure for enterprise clients and startups across financial services, investment management, education, and compliance.

About

My work spans data pipelines, agentic AI, data warehousing, data modeling, MLOps, backend software engineering, and cloud infrastructure across multiple engagements.

I also speak about generative AI, AI coding agents, and modern engineering workflows through technical talks and community events.

Data EngineeringAI InfrastructureGenAIAgentic AIMLOpsData PipelinesData WarehousingBackend EngineeringCloud Infrastructure

Professional Experience

Thinking Machines Data Science

Data Engineer II | Aug 2024 - Present

thinkingmachin.es

A large Singaporean investment holding company

Investment data platform

  • Owned backend feature development across more than 10 interconnected microservices powering deal discovery, due diligence, and portfolio monitoring workflows.
  • Built backend services and data pipelines with FastAPI, Dagster, Kubernetes, and Snowflake, processing terabytes of data from vendor platforms including Sustainalytics, PitchBook, Bloomberg, and MSCI.
  • Improved platform reliability and observability through production monitoring workflows using Kibana and Grafana.
FastAPIDagsterSnowflakePostgreSQLSQLAlchemyPydanticKubernetesKibanaGrafanaJenkinsBitbucket

Enterprise document intelligence platform

  • Iterated on Snowflake Cortex AI agent workflows with the Knowledge Graph team so investment officers could extract data from long-form documents, run contextual queries, and analyze portfolio information at scale.
  • Owned agent tooling, prompt engineering, evaluation, workflow tuning, and production incident triage across the document intelligence stack.
  • Partnered with a roughly 15-person cross-functional team spanning frontend, backend, LLM workflows, MLOps, DevOps, and QA.
FastAPISnowflakeSnowflake CortexOpenAI APIKubernetesKibanaGrafanaAzure DevOps

A major Philippine bank

Enterprise data products

  • Led the data products workstream in a year-long engagement, partnering with C-suite leaders, directors, and business units to define and ship priority data products on Azure Databricks.
  • Designed and productionized a daily Single Customer View pipeline that consolidated roughly 15 million records from four enterprise systems into about 7 million unique customer keys used across more than 10 business units.
  • Built a probabilistic record-linkage engine with Splink, surfaced 748,000 candidate duplicate pairs missed by exact matching, and designed a five-tier confidence framework validated at more than 99.9 percent accuracy.
  • Cut Single Customer View runtime from more than four hours to under one hour and co-designed a Next Best Product recommendation pipeline covering 12 product types and four customer segments.
Azure DatabricksPySparkSQLSplinkDelta Lake

A Philippine education enterprise

Student-at-risk prediction platform

  • Spearheaded the infrastructure and ingestion workstream for a student-at-risk prediction platform on Azure Databricks.
  • Provisioned and managed platform infrastructure with Terraform, built API ingestion pipelines across three source systems, and migrated Excel-based linear regression models into production Databricks jobs.
  • Accelerated team delivery by integrating AI coding agents into the development workflow using custom agent skills, hooks, and command layers.
Azure DatabricksTerraformPythonSQL

A large Singaporean enterprise

Data ingestion and transformation

  • Led the ingestion and transformation workstream, building end-to-end pipelines from four source systems in SQL Server and MongoDB into BigQuery.
  • Designed surrogate key strategies to resolve identifier collisions between member and visitor systems and modeled production reporting tables for attendance analytics.
  • Resolved memory failures in legacy Airflow DAGs by implementing chunked CSV processing before production rollout.
BigQueryCloud ComposerAirflowDataformSQLMongoDBSQL Server

A Philippine airline

Enablement curriculum design

  • Designed and delivered a six-course enablement curriculum covering Python and Power BI tracks from beginner to advanced levels in four days.
PythonPower BICurriculum Design

Internal Contributions

  • Authored a standardized data ingestion scoping framework RFC for the engineering consulting team.
  • Created post-exam study guides that supported certification preparation across the company.
  • Built a proof-of-concept integration connecting ChatGPT to Databricks through an MCP server.

Ingenuity Software

Backend Software Engineer | Jan 2022 - Jul 2024

Built the backend of a compliance case-management platform on Django, deployed across two AWS regions with separate databases for data residency and GDPR requirements. Decomposed the monolith into cross-region services and shipped roughly 50 production API endpoints supporting more than 100 reports per day.

Software Engineer (Part-time) | Jan 2022 - Sep 2022

Built a Google Cloud data processing pipeline for a COVID-19 contact tracing and vaccination tracking application and led a two-week data science internship program for eight participants.

UP Diliman EEEEI

Research Associate | Aug 2022 - Nov 2023

Co-authored and published the IEEE AIComprehend paper, then built and deployed the full-stack Django application used in a month-long controlled study where students improved their test scores by 13.9 percent.

GCash

Technology and Operations Intern | Jun 2022 - Aug 2022

Modeled and transformed production datasets into Looker Studio dashboards used by the Technology and Operations team to track a 300-plus participant developer program.

Technical Skills

Languages

PythonSQLJavaScriptJavaC++Bash

Data Engineering

DatabricksSnowflakeBigQueryPostgreSQLMongoDBPySparkPandasAirflowDagsterdbtDataformAzure Data FactoryAWS GlueDataflow

ML and AI

OpenAI APIVertex AIAzure AI ServicesSnowflake CortexLangChainKerasTensorFlowSplinkLLM EvalsRAGEmbeddingsDocument ExtractionComputer VisionNLPAI AgentsMLOps

Cloud and Infrastructure

AWSAzureGCPDockerTerraformKubernetesCI/CDIAMNetworking

Backend Engineering

FastAPIDjangoFlaskSQLAlchemyPydantic

Development Tools

CursorClaude CodeCodex CLIGitHubAzure DevOpsGitBitbucketJiraConfluencePostmanJenkins

Certifications

AI Technical Practitioner

OpenAI

Mar 2026

Professional Machine Learning Engineer

Google Cloud

Jul 2025 - Jul 2027

Databricks Certified Data Engineer Associate

Databricks

Dec 2025 - Dec 2027

Azure AI Engineer Associate

Microsoft

May 2025 - May 2026

Generative AI Leader

Google Cloud

Oct 2025 - Oct 2028

AI/ML Pre-Sales Technical Expert

Google Cloud

Dec 2025

Airflow 3 Fundamentals

Astronomer

Jul 2025

AWS Certified Cloud Practitioner

Amazon Web Services

Mar 2024 - Mar 2027

Azure Fundamentals

Microsoft

Jun 2024

Cloud Digital Leader

Google Cloud

Apr 2024 - Apr 2027

Community Engagements

Google Developer Group Davao

Community Lead / L&D Lead | Jul 2022 - Present

Led the organizing team behind more than 10 in-person events over 2.5 years, bringing together 2,000-plus participants, around 25 speakers, around 25 partners and sponsors, and more than 50 volunteers. Helped grow the community from zero to 500-plus official members.

Global Shapers Community

Shaper, Davao Hub | Jan 2026 - Present

Member of the World Economic Forum-backed network of young leaders driving local impact through community projects.

Speaking

Conference Talk

Geeks On A Beach AI Show and Tell

Spoke about configuring AI coding assistants with subagents, agent skills, and AGENTS.md.

Startup Community Event

Maximizing Your AI Dream Team

Talk on working with AI agents and modern engineering workflows.

Community Talk

AWS User Group Davao

Invited speaker.

Conference Presentation | Doha, Qatar

ISNCC 2023

Presented the AIComprehend research paper.

Education

University of the Philippines Diliman

BS Computer Engineering | 2019 - Jul 2024

  • Graduated summa cum laude with a 1.15 weighted average and finished in the Top 5 of the program.
  • DOST-SEI Merit Scholar.
  • Active in Google Developer Student Clubs UP Diliman and the UP Data Science Society.

Philippine Science High School - Southern Mindanao Campus

High School Diploma, STEM | Jun 2013 - May 2019

  • Graduated with High Honors.
  • Received the DOST Youth Excellence in Science Medal and the Excellence Award in Computer Science.

Notable Achievements

IEEE | Published Nov 27, 2023

AIComprehend: An Adaptive Reading Comprehension Learning Platform Using Machine Learning

AIComprehend is a web-based application designed to enhance English reading comprehension through adaptive multiple-choice questions calibrated to varying difficulty levels. A four-week controlled study with 58 high school students showed a 13.9 percent improvement in test scores for the experimental group.

View IEEE paper

University of the Philippines Diliman | Jul 2024

Summa Cum Laude

Graduated summa cum laude in BS Computer Engineering and finished in the Top 5 of the program.

Apache Software Foundation | 2025

Apache Airflow Open Source Contributor

Contributed documentation updates covering Azure Blob Storage remote logging and Google Cloud Vertex AI operators.