Author Image

Hi, I am Oscar

Oscar CHEN

Software Engineer at FLYWHEEL Inc.

I am a passionate software engineer with 4 years of working experience. Currently aiming to grow as a Data Engineer and DevOps Engineer.

Experiences

1
Software Engineer
FLYWHEEL Inc.

Nov 2023 - Present, Tokyo, Japan

Growth By Data.

Responsibilities:
  • Developed a standalone playground environment for dbt development using shell scripts and Jinja templates, enabling faster development of dbt models. Enhanced system reliability by supporting integrations tests for 40+ complex business logic pipelines.
  • Built and maintained the workflows using open source tools(e.g. dbt‑osmosis, sqlfluff, reviewdog etc.) to streamline local and GitHub development, releasing developers from routine tasks; Refactored a Github Action workflow, which reduces execution time by 75%.
  • Partnered with the APP team to design and implement a pipeline integrating app event data with internal datasets for client’s billing system. Optimized data latency issue among different data sources, ensuring accurate revenue tracking and reducing processing errors. Architect and deployed 200+ robust and easily extendable data pipeline jobs and sensors in Dagster, aligning with complex data dependencies. Led the design and implementation of encryption/decryption with TDD strategy for client‑sensitive data, seamlessly integrating the feature into existing Dagster workflows.
  • Extensively contributed to development logs and documentation streamlining team’s onboarding process. Created clear visual graphs in shared documentation to simplify complex logic, enhancing knowledge transfer and improved overall software maintainability.
  • Optimized CI/CD pipelines by migrating Python linter and formatter, improving execution speed in both CI workflows and VS Code. Implemented ruff to replace slower tools, reducing linter execution time by 90% and improving develop efficiency.
  • Mentored junior engineers on best practices for dbt and Dagster development; Owned infrastructure automation and stability improvement, implementing Terraform‑based deployments(e.g. AWS, Snowflake), reducing manual configuration efforts.

Software Engineer
Tenchijin Inc.

Oct 2020 - Oct 2023, Tokyo, Japan

We endeavor to optimize land use activity for the best of the human

Responsibilities:
  • Lead a 3-member team, architect and develop an internal data catalog, and enhance 30% data usage inside comoany, utilizing Golang, GraphQL and MongoDB. Collaborated with a frontend engineer to create a user-friendly UI. Mentor an intern on programming methodologies and best practices for API development.
  • Initiate, develop and maintain more than 20 ETL pipelines with Airflow and GDAL. Automate the ingestion of the latest satellite data into S3 datalake and transform it into formats fitting the need of the data science team.
  • Spearhead the creation of feature design document within the engineering team, enhancing company members’ understanding of new features and establishing efficient development practices.
  • Increase 30% of users by developing data models for backend system, and devising RESTful APIs for new features, such as Memo/Project on Compass Map leveraging Django, Postgres, Redis, Prometheus, Grafana, and the AWS stack (including EC2, S3, RDS, Lambda, and Route 53).
  • Proactively dockerize several projects inside company, optimizing the scalabity and deployment, saving 30% time of regular deployment with GitHub Action.
  • Develop a machine learning system for continuous model evaluation, and model deployment through API services. Collaborate closely with the data scientist, automating the duplicated procedure through integrating their preprocessing procedures and algorithms into the pipeline system.
  • Contribute to client’s success, providing valuable insights from satellite and weather data analysis on optimal areas for agriculture product or potential construction spots, and contributing to successful decision-making processes.
2

Education

Master in Spatial Information Science
B.Sc. in Control Science And Engineering

Projects

Master Research
Author September 2018 - June 2020

Designedand implemented the preprocess procedure of trajectory data. Used machine learning models(e.g. LightGBM, XGBoost, LSTM etc.) to analyze the mobility pattern, and tried to find out the pattern when people will be in the stations. Compared the performance of traditional statistic methodologies with the modern machine learning methodologies on long-term time series predcition.

2019 KDD Cup Competition Task1
Contributor April 2019 - June 2019

Participated in a big data competition focused on predicting user choices for Baidu Map navigation recommendations. I am responsible for cleaning and exploring large datasets, conducting feature engineering, and building a baseline model using LightGBM. Achieved a commendable rank of 143 out of 789 participants. The competition enhance my problem-solving ability in reality problems, and the enrich my knowledge in data analysis and modeling.