Hi! I am Sadman. I am completing my MS in Computer Science in Fall 2024 at the University of Wisconsin-Madison. I have a strong interest in developing deep learning methods to address modern challenges. My primary focus is on creating learning techniques for visual data (e.g., images, videos, 3D scenes, and synthesis), text data (e.g., corpora, tabular data, graphs, and closed-domain) and their various applications such as in healthcare, industrial automation, and intelligent assistants. In my recent projects, I have worked on multi-modal models, text analysis and generation, image analysis and generation, and state-space models.
During my MS, I also learned about Computer Systems, Networking, HPC and Data Systems. I interned at HPE in Summer 2023 and at Meta in Summer 2024, working on AI/HPC systems. Before coming to UW-Madison, I obtained my undergraduate major in Computer Science from Bangladesh University of Engineering and Technology and my undergraduate thesis was on Interpretable CNN .
In my free time, I can be found taking exercise, traveling places, reading non-fiction, cooking dishes and listening to music.
Masters of Science in Computer Sciences (2022 - 2024)
University of Wisconsin-Madison
Bachelor of Science in Computer Science (2017 - 2022)
Bangladesh University of Engineering and Technology
Course Title | Instructor | Year | Keywords | Projects |
---|---|---|---|---|
Computer Vision | Yin Li | 2024 | CNN, Image Classification, Object Detection, Semantic Segmentation, Image Generation, Human Pose Detection, Video Understanding, 3D Scene Understanding, Medical Image | Unsupervised Domain Adaptation for Semantic Segmentation |
Natural Language Processing | Junjie Hu | 2024 | Language Modeling, RNN, Transformers, Pre-training, Sequence Labeling, Document Modeling, RAG, Knowledge Graph, Dialogue System | Hybrid Model Architecture for Language Tasks |
Foundation Models | Fred Sala | 2024 | Transformers, SSM, In-context Learning, Chain-of-Thought, Specialization, Alignment, Data and Benchmarks, Multimodal, Scaling Laws | Mitigating Hallucination in Vision-Language Models |
Advanced Big Data Systems | Shivaram Venkataraman | 2024 | Big Data Stacks, Scheduling, Resource Management, ML Systems, Batch and Stream Analytics, Graph Processing | Memory Efficient Low-Rank Systems for Large Foundation Models |
Theoretical Foundations of Large-Scale ML | Dimitris Papailiopoulos | 2024 | Generalization, Stochastic Methods, Transformers, System Tradeoffs, Distributed Optimization, Federated Learning, Model Compression | Grokking - What/When/Why? |
Cloud-native Database | Xiangyao Yu | 2023 | Storage Disaggregation, Analytical and Transaction Processing, Serverless, Auto-Scaling, GPU DB, Memory Disaggregation, RDMA, NIC-assisted DB | Evaluation of Wasm for Computation Pushdown in Cloud Database |
Introduction to Big Data Systems | Tyler Caraza-Harter | 2023 | Distributed Filesystem, Distributed Database, Distributed ML, Partitioning, Fault Tolerance, Availability, Streaming | Distributed Data Processing with Hadoop, Spark and Cassandra |
High-Performance Computing | Dan Negrut | 2023 | GPU Architecture, GPU Memory, GPU Scheduling, ILP, Compiler Optimization, CUDA, Multi-core Programming, Supercomputer | Parallel Computing with CUDA, OpenMP and MPI |
SmartNIC Systems | Ming Liu | 2023 | FPGA NIC, SoC NIC, Application Offloading, Caching, Job Scheduling, Load Balancing | Flexible Block Storage Offload for Datacenters |
Advanced Computer Networks | Ming Liu | 2023 | Datacenter Architecture, Flow Scheduling, Routing, Load Balancing, Congestion Control, SDN, Endhost Network | DPDK Evaluation, Datacenter Congestion Control, Disaggregated Storage Profiling |
Operating Systems | Remzi Arpaci-Dusseau | 2022 | Process Management, Process Scheduling, Virtual Memory, Threads, Locking, File Systems, Logging, Storage | XV6, Parallel Sort, Distributed File System |
Introduction to Computer Architecture | Swamit Tannu | 2022 | Processor, Pipelining, Superscalar, MIPS ISA, SIMD, Cache, Memory Hierarchy | 5-stage Pipelined Processor for MIPS-like ISA |
Worked on InfiniBand fabric for AI/HPC clusters
Contributed to Linux kernel drivers for HPE Slingshot networking devices. Designed and implemented kernel-module API across multiple drivers.
Implemented a distributed hash join algorithm with linear-probing indexing utilizing remote shared memory. Evaluated the implementation on TPC-H benchmark and achieved performance close to Arrow’s native hashjoin.
Designed an array of interpretable CNN models to analyze sequence motifs of DNA bendability. Developed efficient algorithms for processing large-scale data.
Collaborated with Prof. Matthew Caesar and UIUC students to develop a virtual IoT platform for deployment and monitoring of IoT network. Participated in both development and leadership roles.
Designed network architecture for authentication and application mobility solutions in federated edge computing. Developed 3GPP-compliant mobility and roaming protocols for 4G/5G networks.