CV
Below is my CV that I rarely update. Publication include on-going task. Newest pdf version available upon request.
Basics
| Name | Eryawan Presma Yulianrifat | 
| Label | Build AI & High Perf Computing | 2nd National Data Mining | Silver Medalist IDN NOI | ICPC Regional Finalist | 
| zazaneryawan@gmail.com | |
| Url | https://linkedin.com/in/eryawan-presma-yulianrifat | 
| Summary | Computer Science student with research interests in Language Models, Reinforcement Learning, and makes Machine Learn. Active in research, competitions, and AI development. Strong background in competitive programming and data science, boasting 4 years of 30-ist competition with significant Regional & National scale achievements. | 
Education
-  2022.07 - 2025.12 Indonesia Bachelor's DegreeUniversitas IndonesiaComputer Science- Data Structure & Algorithm, Algorithm analysis
- Machine Reinforcement Learning, Deep Learning
- Statistic and Probability, Information Retrieval
- Linear Algebra, Calculus, Discrete Mathematics
 
Publications
-  2025.01.01 IWSLT-2025 Low-Resource Languages Speech Translation Technology: Maltese to EnglishDeveloped a Maltese-to-English speech translation system. Fine-tuned OpenAI's Whisper V3 Large and implemented joint learning. Utilized QLoRA to optimize performance while maintaining computational efficiency.
-  2025.01.01 SemEval-2025 Task 11: Evaluating State-of-the-Art Encoder for Multi-Label Emotion DetectionConducted a systematic comparison of current state-of-the-art models for multilingual multi-label emotion detection, evaluating various encoders and training strategies. Highlighted the superiority of prompt-based encoders like BGE and mE5 combined with tree-based models over fully fine-tuned transformers. [On Submission]
-  2025.01.01 Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast AsiaOpen-source dataset creation for SEA languages, contributing 300 data points. On submission to ACL 2025.
-  2024.01.01 ChatGPT Assistance on Biochemistry Learning Outcomes of Pre-Service TeachersPerformed A/B testing to find out whether there is a statistically significant difference between accuracy and time taken for ChatGPT-assisted teaching versus non-ChatGPT-assisted teaching. Paper will be published soon.
-  2024.01.01 Investigating the Challenges of First-Year Education Students in Basic Physics I PracticumPerformed A/B testing and experiment design to analyze the challenges of lab practicum sessions. Accepted in Journal of Research and Physics Education Studies (JRKPF).
-  2024.01.01 Beyond the Gap: Understanding Gender Disparities in Indonesia’s National Science Olympiad AchievementsConducted a mixed-method research investigating gender disparities in Indonesia's National Science Olympiad (OSN) achievements. The research found significant male dominance across subjects and explored the impact of social stereotypes and biological factors on female performance. Accepted in Daengku, Journal of Humanities and Social Sciences Innovation 2024.
-  2024.01.01 Exploring Indonesian Consumer Health Question Answering using Active Multi RetrievalOptimizing integration of RAG and LLM in Indonesian language assisted by ColBERT retrieval to reduce hallucination and enhance confidence using an Active Multi Retrieval System.
Awards
-  2024.09.012nd Winner - Gemastik Data MiningMinistry of Education, Culture, Research, and TechnologyAdvanced through preliminary rounds with a research paper titled 'Exploring Indonesiaon Consumer Health Question Answering utilizing Active Multi Retrieval', dominated the leaderboard in ordinal regression for university tuition cost optimization.
-  2021Silver Medal - Indonesian National Olympiad in InformaticsMinistry of Education, Culture, Research, and TechnologySecured 8th place in Indonesia's biggest high school competition, earning a direct admission to Universitas Indonesia without any test.
-  2024.04.01Finalist - Smart Logistics DatathonThe Chinese University of Hong KongCompeted against Asia's best universities in a 36-hour datathon addressing Hong Kong logistics challenges with theoretical and computational solutions.
-  2023.11.01Rank 15 - ICPC Regional Asia JakartaInternational Collegiate Programming Contest (ICPC)Solved 13 algorithmic and mathematical problems under time pressure representing university-level programming excellence.
-  2023.11.01Finalist - ICPC Regional Asia JakartaInternational Collegiate Programming Contest (ICPC)Competed in world-level algorithmic and mathematical problems in one of Asia's most prestigious programming competitions.
-  2023.10.011st Winner - InnovationQuestAirlangga UniversityDeveloped 'Road Damage Level Detection' using multimodal text and image fusion, combining YOLO, ResNet, BERT, and CatBoost.
-  2023.10.013rd Winner - DataquestAirlangga UniversityDeveloped time series rainfall prediction and solved an OCR task related to medicine, securing 2nd in prelims and 3rd in finals.
-  2023.09.01Semi-finalist - Satria Data CompetitionMinistry of Education, Culture, Research, and TechnologyBuilt Indonesian NLI models trained on translated datasets using distributed cloud training to predict legal harmony in constitution documents.
-  2023.02.012nd Winner - DatavidiaBandung Institute of TechnologyDeveloped a hybrid FCN-LSTM rainfall prediction model and designed an efficient earthquake early warning system.
-  2021.12.013rd Winner - ILPC (Informatics Logical Programming Competition)UBAYA UniversityExcelled in logic and discrete mathematics challenges and teamwork-based coding games in final stage.
-  2021.09.015th Winner - Informatics Olympiad (IO)Jember UniversitySolved theoretical computer science problems covering discrete mathematics, logic, and graph theory.
Skills
| Data Science | |
| Machine Learning | |
| Deep Learning | |
| Time Series Forecasting | |
| Data Mining, Analysis, and Visualization | |
| Applied Statistics : Hypothesis & A/B Testing | 
| Competitive Programming, Algorithm & Data Structure | |
| C++ | |
| Time & Memory Analysis | |
| Discrete Mathematics Analysis | |
| Algorithm Design & Proof | |
| Advanced Data Structure such as Lazy Segmentree, Sparse Table, and LCA | 
| Natural Language Processing | |
| Large Language Models (LLM) | |
| Information Retrieval | |
| Generative Language Models | |
| LM and RL from Human Feedback (RLHF) and Verifiable Reward (RLVR) | |
| transformers library | 
| Reinforcement Learning | |
| Hacking Stable Baselines3, gymnasium, and tianshou | |
| Classic Offline Dynamic Programming RL | |
| Policy Gradient Methods (DQN, TPRO, PPO, GPRO) | 
| Computer Vision | |
| Image Processing (Filtering, Transformation, Segmentation) | |
| Object Detection & Image Classification | |
| Optical Character Recognition (OCR) | 
| Programming, Systems & Infrastructure | |
| Python, Type Script & Java Script | |
| Linux & Shell Scripting | |
| High Performance Computing on cloud (HPC) | |
| Distributed Training | 
| Mathematics | |
| Discrete Mathematics & Proofs | |
| Calculus & Vector Calculus | |
| Linear Algebra | |
| Statistics, Probability, and Information Theory | |
| Stochastic Process, Markov Decision Process, and Game Theory | 
References
| Alfan Farizki Wicaksono, S.T., M.Sc., Ph.D. | |
| Direct Supervisor. Research (Faculty - Asst Prof & Above), NLP Researcher Lab & Head of Computer Science Major at Universitas Indonesia | 
| Prof. Haryadi Gunawi | |
| Research Mentor. Professor at the Department of Computer Science, University of Chicago | 
Projects
-  2023.01 - 2023.12Contextual Modeling: Racism in Text IdentificationNatural language processing model that predicts whether a given reply to a given context is racist. Research is mainly focused on developing a model that could understand context. Results show our technique could use context in some cases using a certain representation.
-  2023.01 - 2023.12Optical Character Recognition for MedicineComputer Vision approach to obtain medicine’s name from an arbitrary perspective of a medicine. Utilizing image processing to create a deep learning model pipeline that generalizes better over perspective, rotation, warping, and color to adapt better on new image data points.
-  2023.01 - 2023.12Indo NLI XLMR Base Fine-tuneNatural Language Inference (NLI) entails determining if two statements contradict, are neutral, or entail each other. The XLMR Base model was fine-tuned using a vast dataset of 4 million translated texts. To optimize training on this large dataset, employing multiple techniques, including data parallelization, gradient accumulation, gradient checkpointing, and NVIDIA mixed precision is a must. Trained on 2 RTX 6000 GPUs on cloud computer. Outperforms others on key datasets.
-  2022.01 - 2023.12Personal Stock AnalyticsUsing LSTM to analyze price changes. Combine with sentiment analytics scraped from social media. The model predicts a confidence interval. When the condition is not predictable, the predicted range will be really big. Not deployable model because of lack of generalization and predictive features.
-  2022.01 - 2023.12Instagram Scraper ModuleRobust Multiprocessing module to scrape massive amounts of Instagram data. Scrape post, profile picture, following, follower, and create a graph representation of a friendship. Focus mainly on finding hidden API and bypassing any required rendering in order to achieve full speed.
Volunteer
-  2024.06 - Present Data ScientistData Science Initiative, Youth Catalyst FoundationWorking directly under CTO to solve business problems by translating them into technical data objectives. Built a complete data infrastructure campaign using Mixpanel. Developed recommendation systems and a RAG chatbot for suggesting competitions, mentors, internships, scholarships, volunteer opportunities, and events.
-  2023.08 - Present ExpertKaggleActively participated in discussions and published notebooks that garnered upvotes, earning a place in the top 3,300 out of 325,000 entries. Participated in 25 competitions, including a public playground competition ranked 23rd out of 934.
-  2022.12 - Present MemberIkatan Alumni Tim Olimpiade Komputer IndonesiaPrestigious Indonesian olympiad winner organization regularly holding task force for national olympiad, national training, and an annual meetup connecting Indonesia's top computer science talents.
-  2022.07 - 2022.11 Head of OrganizersRISTEK x Datacamp ScholarshipManaged the distribution of 50 Datacamp premium scholarships and ensured a high-quality learning experience.
-  2022.06 - 2022.11 Teaching AssistantRISTEK Sister in TechMentored women in tech, provided materials and project tasks to enhance their portfolios.
-  2022.04 - Present Lead of Data Science & AnalyticsRISTEKLed the Data Science & Analytics team, organized Indonesia’s first Ristek Datathon, and developed programs supporting AI transformation in Indonesia.
-  2022.02 - 2022.11 Expert Staff of Data Science AcademyCOMPFESTDeveloped a 12-day data science curriculum for 12 speakers during the bootcamp. Successfully invited the CEO of a leading AI company in Indonesia.
Languages
| Indonesian | |
| Native Speaker | 
| English | |
| Fluent |