USRC Students and Postdocs

This page lists all current and past students who have worked at the USRC.

Qing Zhen,

Graduate Student, Carnegie Mellon University


zhengq@cs.cmu.edu
Qing is a 4th-year Ph.D. student at Carnegie Mellon University Computer Science Department. At Carnegie Mellon, Qing works with Professor Garth Gibson, researchers at the Carnegie Mellon Parallel Data Lab, and scientists at Los Alamos National Lab (LANL), on file system metadata designs (IndexFS and DeltaFS) for massive-scale science applications. Their IndexFS paper has won Best Paper Award at the Supercomputing Conference (SC) 2014. At NMC Ultra System Research Center (USRC), Qing works with Brad Settlemyer and other USRC and LANL scientists on VPIC and DeltaFS integration, and high-performance metadata implementation and demonstration.

Ziming Zhang,

PhD Student, Computer Science & Engineering, University of N Texas


Visit Homepage
zimingzhang@my.unt.edu
Ziming Zhang obtained his bachelor's degree in computer science and engineering from Beihang University in China 2009. He began to pursue his doctorate at New Mexico Tech in 2009 and worked as research assistant under Dr.Song Fu's supervision. In 2010, he transferred to the University of North Texas to continue his education. His research interest is primarily in power management and system dependability in virtualized cloud computing environment.

At the USRC, Zhang is working on power-aware task scheduling and placement techniques in HPC system.


Panruo Wu,

PhD Student, Computer Science & Engineering, University of California Riverside


pwu011@ucr.edu
Panruo Wu got his bachelor’s degree in mathematics from University of Science and Technology of China in 2011. He is currently a PhD candidate in UCR. His research interests include fault tolerance in parallel and distributed systems and numerical algorithms. At the USRC he works on F-SEFI fault injector and developing highly fault tolerant algorithms that can run correctly and efficiently in the presence of numerous architectural faults.

Kai Wu,

PhD Student, University of California, Merced


Visit Homepage
kwu42@ucmerced.edu
Kai currently a Computer Science PhD student in EECS at University of California, Merced. Before coming to UC Merced, he got his Masters degree in Computer Science and Engineering from Michigan State University in 2016. His research broadly falls into general areas of High Performance Computing (Large-Scale Parallel Systems). Specifically, he focuses on the following areas:(i) Parallel programming models and runtime; (ii) Performance optimization and modeling; (iii) Resilience and Consistency; (iv)Non-volatile memory; (v) Fault Tolerance in Extreme-Scale Parallel Systems. At USRC, Kai is working on building faults model on serial codes and predicting the faults on parallel codes.

Dr. Sean Williams,

Postdoctoral Researcher, New Mexico Consortium


swilliams@newmexicoconsortium.org
I'm working to create a unified abstraction for complex memory systems, along with a suite of libraries to automate the placement of allocations on memory devices.

Ke Wang,

PhD Student Computer Science, Illinois Institute of Technolodgy


Visit Homepage
kewangiit@gmail.com
I am now a second-year Ph.D candidate of the Department of Computer Sciecne (CS) at Illinois Institute of Technology (IIT). I am a member of the Data-Intensive Distributed Systems Laboratory (DataSys) at IIT, and working with Professor Ioan Raicu. My research work and interests are in the general area of distributed systems.

At USRC I am working on simulations to build general framework for distributed system services, such as key-value store, job launch, job scheduling, and distributed file system, with up to exascales (millions of nodes). Our simulator supports different architectures, centralized server, centralized server with multiple aggregation servers, distributed servers with fully connected topology, and distributed servers with chord protocol. Also, we implemented different models, churn model, replication model, replication model and consistency model. Our goal is to understand with all these models, to a certain, which architecture makes sense for system services.


Emily Vecchia,

Undergraduate Student, University of St. Thomas


evecchia@lanl.gov
Emily Vecchia is an undergraduate student majoring in Mathematics with a minor in Computer and Information Sciences at the University of Saint Thomas in Saint Paul, Minnesota. This summer, she is working with her mentor, Laura Monroe, on a probabilistic computing project testing the resilience of algorithms to faults. In the fall, Emily will return to St. Thomas where she will be a senior.

Lewis Tseng,

PhD Student, Computer Science, University of Illinois at Urbana-Champaign


ltseng3@uiuc.edu
Lewis Tseng is a PhD student in the department of Computer Science at the University of Illinois at Urbana-Champaign. His research interests include fault-tolerant distributed algorithms and systems. In particular, he mainly focus on related issues of consensus or consistency in theoretical models that capture the characteristics of large-scale systems in both wired and wireless network. At USRC, Lewis designed and developed a distributed key-value store to support service registration and lookup service for HPC systems. The main goals of the new key-value store are: (i) autonomous system management, (ii) configurable per-key consistency and replication at the data granularity, (iii) flexible and configurable deployment based on different policies for power, performance and resilience.

Zhou Tong,

PhD Student, Computer Science, Florida State University


tong@cs.fsu.edu
Zhou is a Ph.D. student in the Department of Computer Science at Florida State University, and he received his B.S. from Millsaps College. His research includes interconnection network, parallel application performance modeling and data mining. At the USRC, Zhou worked on performance evaluation and communication modeling on MPI applications to provide fast classification assessment based on the understanding of the characteristics of various parallel applications in production HPC systems.

Olena Tkachenko,

Post Bachelor, Los Alamos National Laboratory


otkac001@fiu.edu
Olena graduated with a B.S. in Computer Science from FIU's School of Computing and Information Sciences in Miami. At FIU she did research at the VISA lab as a URA working on masquerading network traffic for Mission Critical Cloud Computing, and isolation benchmarking of containers. While working at LANL as a PostBac she designed an application model (IMCSim) of the implicit Monte Carlo particle code IMC using the Performance Prediction Toolkit (PPT), a discrete-event simulation-based modeling framework for predicting code performance on a large range of parallel platforms. At USRC she is currently working on predicting DRAM fault locations in HPC systems using structured learning and various ML techniques. Her research interests include HPC, ML, and fault prediction/mitigation.

Dr. Li Tan,

Postdoctoral Researcher, Los Alamos National Laboratory


Li Tan graduated with a Ph.D. degree in Computer Science from University of California, Riverside (UCR) in 2015. His chief research interest is High Performance Computing (HPC), in particular improving resilience/reliability and energy/power efficiency for high performance scientific algorithms and applications, and software debugging in large-scale HPC environments. At USRC, he works in fine-grained resilience and low-power modeling and provisioning for HPC applications, using fault injection and near-threshold voltage reduction techniques. He served as a reviewer for prestigious conferences and journals on high performance parallel and distributed computing, such as SC, IPDPS, PACT, CCGrid, IEEE TPDS, IJHPCA, and JSS. He is a recipient of Dean's Distinguished Fellowship from UCR in 2010. He is a Member of the IEEE and a Member of the ACM.

Philip Soltero,

Graduate Computer Science Dept, University of New Mexico


psoltero@cs.unm.edu
Philip is an ex-game developer turned computer science student. In Spring 2005 he was accepted into the computer science department at the University of New Mexico and in Summer 2005, he joined the UNM CS department's Scalable Systems Lab. He graduated with his BS in CS in Spring 2011 and joined the USCR in Summer 2011. His research interests span the realms of virtualization and supercomputing. On his free time he does autocross on his FC RX-7 - an older, Japanese sports car that is turning out to be a continuous project and a lesson in patience.

At USRC. 
Philip worked on fault tolerant, scalable system services for ultra-scale computing.

Homepage


Ryan Slechta,

Graduate Student, Ohio State University


rslechta@lanl.gov
Ryan Slechta received his Bachelors degree in Mathematics and Computer Science from the University of St. Thomas in May 2016. He works with NMC staff and scientists on problems of algorithmic resilience, and is currently working to improve the reliability of erasure code techniques. In the fall, he will be joining the Topology, Geometry, and Data Analysis group at The Ohio State University.

Michael Sevilla,

PhD Student, University of California, Santa Cruz


msevilla@ucsc.edu
Michael evaluates and designs distributed file system metadata management systems. His publications prototype ideas on CephFS, the file system that uses the Ceph distributed object store. His lab also has a special interest in storage system programmability and reproducibility in systems research. At USRC, Michael is working with Brad Settlemyer on load balancing policies for HXHIM, an HPC key-value store.

Jian Peng,

PhD Student, Computer Science, Illinois Institute of Technology


jpeng10@hawk.iit.edu
I'm a second-year Phd student majoring in Computer Science at Illinois Institute of Technology(IIT), advised by Dr. Ioan Raicu. My research interest is HPC.

Currently at USRC I'm working on Burst Buffer simulation in Dragonfly network. The goal is to develop a simulator which models supercomputers with dragonfly network and burst buffer storage architecture. With such a simulator, we will be able to carry out more research on problems such as system bottleneck and burst buffer related scheduling.


Doug Otstott,

PhD Student, School of Computing & Information Sciences, Florida International University


Visit Homepage
dotst001@fiu.edu
Doug is a PhD student from Florida International University's School of Computing and Information Sciences in Miami.  It was here that he began his graduate studies after graduating in the Summer of 2011 and receiving the CS program's award for Outstanding Graduate.

While in Miami, Doug works at lab for Virtualized Infrastructure, Software and Applications at FIU doing research on caching for large scale systems and scheduling optimizations for SSDs.

During his time at USRC, Doug will be working to extend a preexisting project "Transparently Consistent Asynchronous Shared Memory" by exploring applications in check pointing and in-situ data analysis.


Nicholas Nelson,

Graduate Student, New Mexico State University


njnelson@nmsu.edu
Nicholas received his bachelor's degree in micro-biology from Cornell University in 2010. After working in Flow Cytometry and Scanning Electron Microscopy he entered the computer science graduate program at New Mexico State University. Nicholas is researching parallel programing in high performance computing (HPC). At the USRC, Nicholas worked on benchmarking OpenSHMEM in contrast to MPI.

Adam P Morrow,

Undergraduate Student, Brigham Young University


Adam is an undergraduate intern at the New Mexico Consortium where he is working on mapping error clusters to originating faults and errors in DRAM units on leadership-class supercomputing machines. He is pursuing a B.S. degree at Brigham Young University in Applied Computational Mathematics and Computer Science.

Atiqul Mollah,

PhD Student, Florida State University


mollah@cs.fsu.edu
Atiqul received his Bachelor’s degree from Bangladesh University of Engineering and Technology. Currently he is a PhD student at Florida State University. His area of research interest include Software Defined Networking(SDN), Interconnection Networks, Data center networks and Parallel Processing using MPI.

At USRC, he is working on implementation of dynamic and scalable services in HPC systems using OpenFlow/SDN.


Kshitij Mehta,

PhD Student Computer Science, University of Houston


Visit Homepage
kvm1983@gmail.com
Kshitij is a high performance computing (HPC) researcher and programmer. He works on developing Parallel I/O interfaces for OpenMP and shared memory machines as part of his Phd. Additionally, he works on performance modeling of parallel I/O operations. His specialties are C, Posix Threads, OpenMP, Parallel I/O, MPI, and modeling.

At the USRC
 Kshitij worked on integrating HDF5 with PLFS, writing a plugin using HDF5's Virtual Object Layer (VOL) that uses LANL's PLFS.


Santosh Mahapatra,

PhD Student, Computer Science, Florida State University


Visit Homepage
santosh.kumar.mahapatra@gmail.com
I am a PhD candidate in the Computer Science department at Florida State University (FSU), Tallahassee, Florida, US. My areas of interest include general areas of Computer Networks, Interconnection Networks, Parallel Architectures, Storage Area Networks, Data Centers, High Performance Computing (HPC) clusters. His current area of research is routing and load-balancing in high performance computing (HPC) clusters and the Data Center networks.

Santosh worked with the USRC Systems group on Interconnection networks evaluating different topologies across various routing schemes to identify the best networking paradigms for the next generation super-computers.


Jason Lee,

Master's Student, Computer Science, Florida State University


Jason Lee is a master's student at Florida State University and received a B.S. from Rensselaer Polytechnic Institute. His interests include cryptography, parallel programming, and networking.

At the USRC he is working on software defined networking with Infiniband for high performance computing systems.


Scott Lavigne,

Graduate Student, Ohio State University


lavigne@lanl.gov
Scott is a graduate student studying network errors on LANL's Trinity supercomputer. While obtaining his BS in Computer Science with a minor in Applied Mathematics from Coastal Carolina University, Scott worked on various projects for the USRC, ranging from fault injection studies with F-SEFI to analyzing ECC of interest to the team. In the fall, Scott will begin the direct PhD track at The Ohio State University.

Michael Lane,

Post Bachelor, Los Alamos National Laboratory


mlane@lanl.gov
Michael Lane is researching techniques to make exact computations on fault-prone hardware.

Mitchell Klein,

Student, Los Alamos National Laboratory


mjklein@lanl.gov
​Mitchell Klein is a summer intern at Los Alamos where he is working on a project that tests the resilience of algorithms. He recently earned his B.A. in Applied Mathematics from the University of St. Thomas in St. Paul, Minnesota. In the future, Mitchell plans to pursue graduate studies in mathematics or a related field.

Stevenson Jian,

PhD Student Electrical & Computer Engineering, U of Illinois at Urbana Champaign


xunjian1@illinois.edu
Stevenson is a PHD student in the department of Electrical and Computer Engineering, at the University of Illinois at Urbana Champaign. His research interest has been on reducing the power and storage overhead of error resilient memories.

Stevenson was at USRC
 to study the reliability of different strengths of chipkill correct memories under different faulty DIMM replacement policies.


Song Huang,

Ph.D. Student, Department of Computer Science and Engineering, University of North Texas


SongHuang@my.unt.edu
Song Huang is a Ph.D. student in the Department of Computer Science and Engineering at the University of North Texas. He works in the Dependable Computing Systems Lab directed by Dr. Song Fu. His research interests include power and energy consumption on the HPC system, disk failure modeling and analysis, resilience and fault tolerance techniques on the HPC system. Currently, he works at USRC on characterizing the power consumption on the Haswell machines and resource allocation and scheduling on the HPC system.

Bin Huang,

PhD Student in Reconfigurable Computing System Lab, University of North Carolina at Charlotte


bin.arthur@gmail.com
Bin is a PhD student in Reconfigurable Computing System Lab at the University of North Carolina at Charlotte. His research interest includes novel computer architecture using FPGA technology and resilient many-core chip.

At USRC, Bin conducted research on a resilient runtime system framework for heterogeneous many-core architecture which combines a few reliable cores and many less reliable cores.


Jun He,

PhD Student Computer Science, Illinois Institute of Technolodgy


Visit Homepage
junhe@gmail.com
I am a Ph.D. student at the Department of Computer Science, Illinois Institute of Technology. I received my B.E. and M.S. from Hunan University, China. Currently, my research interests are parallel file systems, I/O analysis and optimization. My homepage is http://mypages.iit.edu/~jhe24/.

For my work at USRC, I am working on developing and evaluating algorithms by which patterns in the PLFS (Parallel Log-structured File System) metadata can be discovered and then used to replace the current metadata, in order to reduce metadata size


Abida Haque,

PhD Student, Computer Science, North Carolina State University


haqueabida@gmail.com
Abida will be a PhD student at North Carolina State University in computer science. She has a bachelor's degree in mathematics from Carnegie Mellon University and a master's degree in computer science from Georgia Tech.

During her time at USRC, Abida will help with the project Latent Anomaly Detection for Supercomputing System Performance.



Page 1 of 2

© 2017 New Mexico Consortium