Idan Arbiv

Algorithm Engineer | GenAI, NLP, Deep Learning

Algorithm Engineer at WSC Sports specializing in Generative AI, NLP, and Deep Learning, building AI-driven services with publications in ICML and NeurIPS.

About Me.

Hey! 👋 I'm Idan Arbiv, an Algorithm Engineer at WSC Sports specializing in Generative AI, NLP, and Deep Learning. I recently completed my M.Sc. in Computer Science at Ben Gurion University, with published research in ICML and NeurIPS. Previously, I was a Teaching Assistant in programming and distributed systems, and served as a Communications Officer in the IDF, leading tech operations with a team of 40+. I’m passionate about building advanced AI-driven services and pushing the boundaries of deep learning innovation.

Technologies I've worked with:

React

Git

Linux

Android

Jira

MongoDB

NextJS

NumPy

WebSockets

Jquery

Download CV

Experience.

APR 2025 – PRESENT
GenAI & NLP Algorithm Engineer
WSC Sports
• Develop and research production-grade services using NLP and GenAI Algorithms
AUG 2023 – APR 2025
Algorithm Engineer - CTO Group
SONY Semiconductor
• Developed Web LLM-based applications using RAG architecture using Chroma, LangChain, and Streamlit. overseeing the entire process from design and architecture to development, testing, CI/CD using TeamCity, and deployment.
• Created a web-based internal search engine using the ELK Stack (Elastic Search, Kibana) and web scraping methods, managing all stages from initial design to production using Jira, Git, Bitbucket.
• Built a Python package for interactive plot visualization based on the Matplotlib package, enhancing data analysis capabilities.
• Developed a Python-based desktop application for chip power consumption analysis using Tkinter and Pandas.
• Improved the algorithmic 5G simulation code architecture, facilitating the development of new algorithms for chip design and ensuring robust CI/CD processes and production deployment.
OCT 2023 - OCT 2025
Teaching Assistant
Ben Gurion University
• CS 202.1.2051 - Principles Of Programming Languages (Spring 2024, 2025)
• CS 202.1.5391 - Distributed Systems Programming (Winter 2024, 2025)
MAY 2022 - AUG 2023
Software Engineer (Student)
SONY Semiconductor
• Developed testable, scalable, and maintainable code for new and existing software for cellular IoT applications, utilizing OOP, SOLID principles, and design patterns in desktop applications using Java.
• Developed new software functionalities using Java and Spring Boot framework on the server side, as well as databases management, leveraging my understanding of various algorithms.
• Worked on the client-side using React and Redux, creating dynamic and responsive UI components that enhanced user experience while employing analytical tools for optimal results.
NOV 2015 - AUG 2020
Communications (C4I) Officer
J6 & Cyber Defence Directorate, IDF
• Responsible for all communication and technology systems in a variety of combat and special units. Command of over 40 soldiers, 15 officers, 5 noncommissioned officers, demonstrating strong leadership while leading the units to success in a variety of operational missions in different arenas, including special missions abroad. Received excellence certificate for performance.

Education.

Ben-Gurion University

Master's Degree in Computer Science

Pursuing an M.Sc. with a 96 GPA, achieving Dean’s List honors. focusig on Deep Learning, Sequential Modeling, Gen-AI, and Representation Learning. Research under Dr. Omri Azencot, with publications in ICML and NeurIPS.

2023 - 2025

Ben-Gurion University

Bachelor's Degree in Computer Science

Graduated summa cum laude with a GPA of 93, achieving Dean’s List honors. Participated in the Dkalim Research Program for Excellent Students under the supervision of Dr. Omri Azencot, focusing on advanced computer science research and algorithms.

2020 - 2023

Deep LearningRepresentation LearningSequential ModelingGenerative ModelsNLPUnsupervised LearningComputer VisionDistributed SystemsData StructuresAlgorithmsImage processing

Publications.

Sequential Disentanglement by Extracting Static Information From A Single Sequence Element

One of the fundamental representation learning tasks is unsupervised sequential disentanglement, where latent codes of inputs are decomposed to a single static factor and a sequence of dynamic factors. To extract this latent information, existing methods condition the static and dynamic codes on the entire input sequence. Unfortunately, these models often suffer from information leakage, i.e., the dynamic vectors encode both static and dynamic information, or vice versa, leading to a non-disentangled representation. Attempts to alleviate this problem via reducing the dynamic dimension and auxiliary loss terms gain only partial success. Instead, we propose a novel and simple architecture that mitigates information leakage by offering a simple and effective subtraction inductive bias while conditioning on a single sample. Remarkably, the resulting variational framework is simpler in terms of required loss terms, hyper-parameters, and data augmentation. We evaluate our method on multiple data-modality benchmarks including general time series, video, and audio, and we show beyond state-of-the-art results on generation and prediction tasks in comparison to several strong baselines.

Authors: Ilan Naiman, Nimrod Berman, Idan Arbiv, Itai Pemper, Gal Fadlon, and Omri Azencot

Conference: NeurIPS, 2024

Diffusion ModelsGenerative ModelingSequential ModelingTime Series

GitHub PDF

Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time

Lately, there has been a surge in interest surrounding generative modeling of time series data. Most existing approaches are designed either to process short sequences or to handle long-range sequences. This dichotomy can be attributed to gradient issues with recurrent networks, computational costs associated with transformers, and limited expressiveness of state space models. Towards a unified generative model for varying-length time series, we propose in this work to transform sequences into images. By employing invertible transforms such as the delay embedding and the short-time Fourier transform, we unlock three main advantages: i) We can exploit advanced diffusion vision models; ii) We can remarkably process short- and long-range inputs within the same framework; and iii) We can harness recent and established tools proposed in the time series to image literature. We validate the effectiveness of our method through a comprehensive evaluation across multiple tasks, including unconditional generation, interpolation, and extrapolation. We show that our approach achieves consistently state-of-the-art results against strong baselines. In the unconditional generation tasks, we show remarkable mean improvements of 49.92% and 132.61% in the short discriminative and (ultra-)long classification scores, respectively.

Authors: Idan Arbiv, Gal Fadlon, Nimrod Berman, Ilan Naiman, and Omri Azencot

Conference: ICML, 2024

Representation LearningSequential DisentanglementSequential ModelingSequential Variational AutoencodersComputer Vision

GitHub PDF

Key Projects.

OCR In the Cloud

This project is a scalable cloud-based OCR system that processes images from URLs using Amazon Web Services (AWS). Users input a text file with image URLs, and the system dynamically allocates cloud resources to download, analyze, and extract text from the images. The result is presented in an HTML file with each image and its extracted text. The architecture includes three main components—a Local application, a Manager, and Workers—using AWS services like SQS and S3 for efficient task distribution, scalability, and persistence. The system is optimized to adjust resources based on workload, providing a cost-effective solution for large-scale OCR processing.

Cloud ComputingAWSOCRDistributed SystemsJava

GitHub

Real Time Connect Four

The Interactive Game project develops an engaging system that enables a human player to compete against a computer in Connect Four using a physical board. Utilizing a webcam and advanced computer vision techniques, the system captures the game state in real-time, accurately detecting the positions and colors of discs. It then employs strategic algorithms to calculate the computer next move and communicates these moves verbally, enhancing interactivity. The project demonstrates high accuracy in move detection and robust performance under varying conditions, successfully bridging the gap between traditional and digital gameplay. Future enhancements may include improved board detection and support for additional colors.

OpenCVImage ProcessingComputer Vision

GitHub

Maximum Weighted Increasing Subsequence

This project focuses on the Maximum-Weighted-Increasing-Subsequence problem, which involves managing the weights of points in a two-dimensional space while maintaining the maximum chain weight of increasing subsequences. The goal is to selectively increase the weights of specific points from 1 to 2 without exceeding the maximum chain weight, thereby preserving the integrity of potential chains formed from these points. By employing both naive and heuristic approaches, the project aims to develop effective algorithms that optimize point weights while exploring complex interactions between geometric and sequential properties.

Dynamic ProgrammingAlgorithm DesignData Structures

GitHub

Open Set Recognition With Contrastive Learning

This project focuses on Open Set Recognition (OSR) using a CNNs to identify known classes from the MNIST dataset while effectively flagging unseen classes as Unknown. By leveraging contrastive learning and decision boundaries in the latent space, the model distinguishes between in-distribution and out-of-distribution samples. The goal is to create a robust and adaptable AI system capable of recognizing new classes in real-world scenarios, enhancing its predictive capabilities beyond closed-set environments.

Computer VisionPyTorchContrastive LearningLatent Representation

GitHub

Hypernym Detection with Hadoop and OCR

Implements a method for automatic hypernym discovery. It utilizes an Amazon EMR cluster to process vast datasets of Google Syntactic N-grams, constructing dependency trees to identify shortest paths between nouns, which are then used to train classifiers with WEKA. The system architecture consists of a main class and two steps scheduled via EMR, allowing for efficient data handling and classification evaluation.

HadoopAWSNLPWEKA

GitHub

Evolutionary Algorithms Partition Problem

The Problem project addresses the NP-complete partition problem, which seeks to divide a set of numbers into two subsets with equal sums. Using a Genetic Algorithm (GA) and the EC-KITY library, the project not only implements a solution to the partition problem but also enhances the library with a live graph feature to visualize algorithm performance in real-time. The implementation involves generating a random array, evaluating individuals based on fitness, and utilizing tournament selection for better results in optimizing the partitioning process.

Evolutionary AlgorithmsEC-KITYPandas

GitHub

Keep in touch.

If you have any questions or just want to say hi, feel free to reach out to me.