About
I am an M.Sc. Informatics student at the Technical University of Munich, working on computer vision and 3D perception. My current research focuses on monocular and multi-modal 3D scene understanding, with an emphasis on geometric representation learning, segmentation, and vision foundation models for robust perception.
Before TUM, I worked as a machine-learning engineer at Aigorithm on computer-vision systems for object detection, segmentation, and active-learning data labeling. I also interned with the Zurich-based Google Shopping team, working from the Munich office on a machine-learning system prototype for offer freshness. These experiences shaped my interest in research that connects visual representation learning, 3D geometry, and reliable perception in real-world settings.
Research Interests
My interests center on robust scene understanding from visual and multi-modal data.
- Monocular and multi-modal 3D scene understanding for autonomous driving and robotics.
- Geometry-aware perception: 3D detection, segmentation, tracking, depth/shape estimation, and scene reconstruction.
- Vision and geometric foundation models: developing, adapting, and evaluating models for robust spatial perception.
- Weakly supervised, self-supervised, and scalable learning from images, masks, depth, and 3D structure.
Experience & Education
-
M.Sc. Informatics
Technical University of Munich
May 2022 - Dec 2026 expected
Expected graduation: Dec 2026. Focus: computer vision, machine learning, and 3D scene understanding.
-
Research & Teaching Assistant
Technical University of Munich
Dec 2022 - Apr 2025
Teaching assistant for Computer Vision 3: Detection, Segmentation and Tracking. Researched unsupervised semantic segmentation with denoising diffusion and student-teacher distillation.
-
Software Engineer Intern
Google Shopping, Zurich team; based in Munich
Aug 2022 - Dec 2022
Developed a machine-learning system prototype for estimating offer freshness from product and price signals.
-
Junior Machine Learning Engineer
Aigorithm, Cairo
Nov 2020 - Apr 2022
Developed computer-vision pipelines for data labeling, active learning, object detection, segmentation and model deployment.
-
Software Engineer Intern
DevisionX, Cairo
Jun 2018 - Aug 2018
Worked on ID verification using segmentation and recognition of Arabic text in TensorFlow, with Python/Flask backend integration.
-
B.Sc. Computer Science
Thebes University
Oct 2014 - Aug 2018
Final GPA: 3.44/4.00. Thesis: Arabic Image Captioning.
Projects
Selected Research Projects
-
M.Sc. Thesis / Current Research: Monocular and Multi-Modal 3D Scene Understanding
PyTorch 3D Perception Autonomous Driving
Researching foundation-model-based and geometry-aware approaches for 3D scene understanding in autonomous-driving scenarios, with emphasis on robust perception and reliable evaluation. -
Guided Research: Multi-Modal 3D Object Detection
[report]
LiDAR Image Fusion
Designed a LiDAR-image 3D detector with relation-aware reasoning for driving scenes and studied feature fusion strategies for 3D box localization. -
IDP: Semi-Supervised Vehicle Part Segmentation
[report | code]
Point Clouds Segmentation
Explored semi-supervised vehicle part segmentation in LiDAR point clouds to reduce annotation cost for autonomous-driving perception. -
Master-Praktikum: Unsupervised Video Segmentation
[report | code]
Video Optical Flow
Combined motion and appearance cues for unsupervised video object segmentation with optical-flow-based consistency losses. -
3DMM Estimation from Highly Distorted Images
[code | report]
3D Reconstruction
Estimated 3D morphable model parameters from highly distorted images and analyzed robustness under camera distortion. -
Semantic Segmentation with DGCNN on ScanNet
[code | report]
DGCNN ScanNet
Implemented point-cloud semantic segmentation on the ScanNet indoor dataset. -
Cobb Angle Estimation for Scoliosis Screening
[project | hackathon]
Medical Imaging
Built a deep-learning pipeline to estimate the Cobb angle from spine X-ray images for automatic scoliosis screening.
Additional Technical Projects
-
Road Damage Detection
PyTorch YOLOv5
Fine-tuned object-detection models and adapted training code for custom road-damage datasets. -
Human Action Recognition
Keras Video
Built video-classification models for human action recognition on UCF101. -
Arabic Image Captioning
Keras NLP
Prototyped Arabic image captioning using translated Flickr8k captions and Arabic word embeddings. -
Predict Future Sales
scikit-learn Time Series
Solved a Kaggle forecasting task with feature engineering, lag features, mean encoding, and ensemble models. -
Gender-Age Prediction
Keras Faces
Built a face-attribute prediction prototype using pretrained convolutional networks. -
Web and C++ Systems
[Q&A | store | courses | tic-tac-toe]
JSP MySQL C++
Built full-stack, backend, and algorithmic software prototypes during undergraduate and self-directed work.
Awards & Competitive Programming
- ICPC: Qualified for the ICPC World Finals through ACPC, with strong regional results including 6th place at ECPC and 2nd place at ECPCQ.
- Meta Hacker Cup: Reached Round 2 in 2019 and 2021; among the top contestants in Egypt.
- Contest judge: Served as ICPC contest judge at several Egypt local contests and regionals.
- Online certificates: Udacity Machine Learning Engineer Nanodegree scholarship; completed in 3 months vs. 6 expected. Also completed the Deep Learning Specialization by deeplearning.ai.
Certificates
- Udacity Machine Learning Engineer Nanodegree [certificate]
- Deep Learning Specialization by deeplearning.ai [certificate]
- Machine Learning Specialization by the University of Washington [certificate]