CV
Below is my CV that I rarely update. Publication include on-going task. Newest pdf version available upon request.
Basics
Name | Eryawan Presma Yulianrifat |
Label | Build AI & High Perf Computing | 2nd National Data Mining | Silver Medalist IDN NOI | ICPC Regional Finalist |
zazaneryawan@gmail.com | |
Url | https://linkedin.com/in/eryawan-presma-yulianrifat |
Summary | Computer Science student with research interests in Language Models, Reinforcement Learning, and makes Machine Learn. Active in research, competitions, and AI development. Strong background in competitive programming and data science, boasting 4 years of 30-ist competition with significant Regional & National scale achievements. |
Education
-
2022.07 - 2025.12 Indonesia
Bachelor's Degree
Universitas Indonesia
Computer Science
- Data Structure & Algorithm, Algorithm analysis
- Machine Reinforcement Learning, Deep Learning
- Statistic and Probability, Information Retrieval
- Linear Algebra, Calculus, Discrete Mathematics
Publications
-
2025.01.01 IWSLT-2025 Low-Resource Languages Speech Translation Technology: Maltese to English
Developed a Maltese-to-English speech translation system. Fine-tuned OpenAI's Whisper V3 Large and implemented joint learning. Utilized QLoRA to optimize performance while maintaining computational efficiency.
-
2025.01.01 SemEval-2025 Task 11: Evaluating State-of-the-Art Encoder for Multi-Label Emotion Detection
Conducted a systematic comparison of current state-of-the-art models for multilingual multi-label emotion detection, evaluating various encoders and training strategies. Highlighted the superiority of prompt-based encoders like BGE and mE5 combined with tree-based models over fully fine-tuned transformers. [On Submission]
-
2025.01.01 Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Open-source dataset creation for SEA languages, contributing 300 data points. On submission to ACL 2025.
-
2024.01.01 ChatGPT Assistance on Biochemistry Learning Outcomes of Pre-Service Teachers
Performed A/B testing to find out whether there is a statistically significant difference between accuracy and time taken for ChatGPT-assisted teaching versus non-ChatGPT-assisted teaching. Paper will be published soon.
-
2024.01.01 Investigating the Challenges of First-Year Education Students in Basic Physics I Practicum
Performed A/B testing and experiment design to analyze the challenges of lab practicum sessions. Accepted in Journal of Research and Physics Education Studies (JRKPF).
-
2024.01.01 Beyond the Gap: Understanding Gender Disparities in Indonesia’s National Science Olympiad Achievements
Conducted a mixed-method research investigating gender disparities in Indonesia's National Science Olympiad (OSN) achievements. The research found significant male dominance across subjects and explored the impact of social stereotypes and biological factors on female performance. Accepted in Daengku, Journal of Humanities and Social Sciences Innovation 2024.
-
2024.01.01 Exploring Indonesian Consumer Health Question Answering using Active Multi Retrieval
Optimizing integration of RAG and LLM in Indonesian language assisted by ColBERT retrieval to reduce hallucination and enhance confidence using an Active Multi Retrieval System.
Awards
- 2024.09.01
2nd Winner - Gemastik Data Mining
Ministry of Education, Culture, Research, and Technology
Advanced through preliminary rounds with a research paper titled 'Exploring Indonesiaon Consumer Health Question Answering utilizing Active Multi Retrieval', dominated the leaderboard in ordinal regression for university tuition cost optimization.
- 2021
Silver Medal - Indonesian National Olympiad in Informatics
Ministry of Education, Culture, Research, and Technology
Secured 8th place in Indonesia's biggest high school competition, earning a direct admission to Universitas Indonesia without any test.
- 2024.04.01
Finalist - Smart Logistics Datathon
The Chinese University of Hong Kong
Competed against Asia's best universities in a 36-hour datathon addressing Hong Kong logistics challenges with theoretical and computational solutions.
- 2023.11.01
Rank 15 - ICPC Regional Asia Jakarta
International Collegiate Programming Contest (ICPC)
Solved 13 algorithmic and mathematical problems under time pressure representing university-level programming excellence.
- 2023.11.01
Finalist - ICPC Regional Asia Jakarta
International Collegiate Programming Contest (ICPC)
Competed in world-level algorithmic and mathematical problems in one of Asia's most prestigious programming competitions.
- 2023.10.01
1st Winner - InnovationQuest
Airlangga University
Developed 'Road Damage Level Detection' using multimodal text and image fusion, combining YOLO, ResNet, BERT, and CatBoost.
- 2023.10.01
3rd Winner - Dataquest
Airlangga University
Developed time series rainfall prediction and solved an OCR task related to medicine, securing 2nd in prelims and 3rd in finals.
- 2023.09.01
Semi-finalist - Satria Data Competition
Ministry of Education, Culture, Research, and Technology
Built Indonesian NLI models trained on translated datasets using distributed cloud training to predict legal harmony in constitution documents.
- 2023.02.01
2nd Winner - Datavidia
Bandung Institute of Technology
Developed a hybrid FCN-LSTM rainfall prediction model and designed an efficient earthquake early warning system.
- 2021.12.01
3rd Winner - ILPC (Informatics Logical Programming Competition)
UBAYA University
Excelled in logic and discrete mathematics challenges and teamwork-based coding games in final stage.
- 2021.09.01
5th Winner - Informatics Olympiad (IO)
Jember University
Solved theoretical computer science problems covering discrete mathematics, logic, and graph theory.
Skills
Data Science | |
Machine Learning | |
Deep Learning | |
Time Series Forecasting | |
Data Mining, Analysis, and Visualization | |
Applied Statistics : Hypothesis & A/B Testing |
Competitive Programming, Algorithm & Data Structure | |
C++ | |
Time & Memory Analysis | |
Discrete Mathematics Analysis | |
Algorithm Design & Proof | |
Advanced Data Structure such as Lazy Segmentree, Sparse Table, and LCA |
Natural Language Processing | |
Large Language Models (LLM) | |
Information Retrieval | |
Generative Language Models | |
LM and RL from Human Feedback (RLHF) and Verifiable Reward (RLVR) | |
transformers library |
Reinforcement Learning | |
Hacking Stable Baselines3, gymnasium, and tianshou | |
Classic Offline Dynamic Programming RL | |
Policy Gradient Methods (DQN, TPRO, PPO, GPRO) |
Computer Vision | |
Image Processing (Filtering, Transformation, Segmentation) | |
Object Detection & Image Classification | |
Optical Character Recognition (OCR) |
Programming, Systems & Infrastructure | |
Python, Type Script & Java Script | |
Linux & Shell Scripting | |
High Performance Computing on cloud (HPC) | |
Distributed Training |
Mathematics | |
Discrete Mathematics & Proofs | |
Calculus & Vector Calculus | |
Linear Algebra | |
Statistics, Probability, and Information Theory | |
Stochastic Process, Markov Decision Process, and Game Theory |
References
Alfan Farizki Wicaksono, S.T., M.Sc., Ph.D. | |
Direct Supervisor. Research (Faculty - Asst Prof & Above), NLP Researcher Lab & Head of Computer Science Major at Universitas Indonesia |
Prof. Haryadi Gunawi | |
Research Mentor. Professor at the Department of Computer Science, University of Chicago |
Projects
- 2023.01 - 2023.12
Contextual Modeling: Racism in Text Identification
Natural language processing model that predicts whether a given reply to a given context is racist. Research is mainly focused on developing a model that could understand context. Results show our technique could use context in some cases using a certain representation.
- 2023.01 - 2023.12
Optical Character Recognition for Medicine
Computer Vision approach to obtain medicine’s name from an arbitrary perspective of a medicine. Utilizing image processing to create a deep learning model pipeline that generalizes better over perspective, rotation, warping, and color to adapt better on new image data points.
- 2023.01 - 2023.12
Indo NLI XLMR Base Fine-tune
Natural Language Inference (NLI) entails determining if two statements contradict, are neutral, or entail each other. The XLMR Base model was fine-tuned using a vast dataset of 4 million translated texts. To optimize training on this large dataset, employing multiple techniques, including data parallelization, gradient accumulation, gradient checkpointing, and NVIDIA mixed precision is a must. Trained on 2 RTX 6000 GPUs on cloud computer. Outperforms others on key datasets.
- 2022.01 - 2023.12
Personal Stock Analytics
Using LSTM to analyze price changes. Combine with sentiment analytics scraped from social media. The model predicts a confidence interval. When the condition is not predictable, the predicted range will be really big. Not deployable model because of lack of generalization and predictive features.
- 2022.01 - 2023.12
Instagram Scraper Module
Robust Multiprocessing module to scrape massive amounts of Instagram data. Scrape post, profile picture, following, follower, and create a graph representation of a friendship. Focus mainly on finding hidden API and bypassing any required rendering in order to achieve full speed.
Volunteer
-
2024.06 - Present Data Scientist
Data Science Initiative, Youth Catalyst Foundation
Working directly under CTO to solve business problems by translating them into technical data objectives. Built a complete data infrastructure campaign using Mixpanel. Developed recommendation systems and a RAG chatbot for suggesting competitions, mentors, internships, scholarships, volunteer opportunities, and events.
-
2023.08 - Present Expert
Kaggle
Actively participated in discussions and published notebooks that garnered upvotes, earning a place in the top 3,300 out of 325,000 entries. Participated in 25 competitions, including a public playground competition ranked 23rd out of 934.
-
2022.12 - Present Member
Ikatan Alumni Tim Olimpiade Komputer Indonesia
Prestigious Indonesian olympiad winner organization regularly holding task force for national olympiad, national training, and an annual meetup connecting Indonesia's top computer science talents.
-
2022.07 - 2022.11 Head of Organizers
RISTEK x Datacamp Scholarship
Managed the distribution of 50 Datacamp premium scholarships and ensured a high-quality learning experience.
-
2022.06 - 2022.11 Teaching Assistant
RISTEK Sister in Tech
Mentored women in tech, provided materials and project tasks to enhance their portfolios.
-
2022.04 - Present Lead of Data Science & Analytics
RISTEK
Led the Data Science & Analytics team, organized Indonesia’s first Ristek Datathon, and developed programs supporting AI transformation in Indonesia.
-
2022.02 - 2022.11 Expert Staff of Data Science Academy
COMPFEST
Developed a 12-day data science curriculum for 12 speakers during the bootcamp. Successfully invited the CEO of a leading AI company in Indonesia.
Languages
Indonesian | |
Native Speaker |
English | |
Fluent |