USRC Students and Postdocs

Tate Kernell,

PhD Student, Baylor University


Tate_Kernell@baylor.edu
Tate is a Ph.D. Candidate at Baylor University studying Riesz Preconditioners for the Rotating Shallow Water Equations on Finite Element Spaces. He received his B.S. Applied and Computational Mathematics and Statistics from the University of Notre Dame.

The objective of his project at the USRC is to gather/analyze a variety of data sets, which will be used in ensuring area specific information. The data collected will range from environmental sensors, network sensors, machine sensors, biomedical databases from the Internet, geo-spatial data sets, error datasets, resiliency data sets, HW component data sets, etc. Then once these are collected, the goal is to build various benchmarks and machine learning algorithms which will allow predictive analysis to assist in making informed choices on topics of interest.


Vanessa Job,

PhD Student, University of New Mexico


vjob@unm.edu
Vanessa Job is a second year Ph.D. student in computer science at the University of New Mexico. She is doing a second Ph.D. after having a long career as a professor of applied discrete mathematics. Her prior research was in combinatorial aspects of error-correcting codes. She is currently interested in applications of machine learning to high performance computing. At the USRC, she is working with Laura Monroe on probabilistic computing.

Wenbin Lu,

PhD Student, Stony Brook University


wenbin.lu@stonybrook.edu
Wenbin Lu is a PhD student from the applied math and statistics department of the Stony Brook University. He received his undergraduate degrees in mathematics and computer science from the University of Iowa. This summer at the USRC, Wenbin will be working on the modeling and simulation of mosquito-borne diseases like the dengue fever and the Zika. The software will be based on Hoover, which utilizes the PGAS programming model OpenSHMEM to reduce inter-node synchronization and improve scalability.

Atanu Barai,

Masters Student, New Mexico State University


atanu@nmsu.edu
Atanu Barai is a former USRC student. At the USRC, he worked on the Performance Prediction Toolkit, which is a discrete-event-simulation based parameterized hardware software co-design framework.

Yehia Arafa,

Masters Student, Klipsch School of Electrical and Computer Engineering


yarafa@nmsu.edu
Yehia Arafa is a former USRC student. At USRC, Yehia worked on performance prediction of super computers through contributing and validating the hardware model of the open source tool developed at Los Alamos National Lab (LANL); Performance Prediction Toolkit (PPT) and improving its prediction accuracy.

Michael Sevilla,

PhD Student, University of California, Santa Cruz


msevilla@ucsc.edu
Michael is a former USRC student. Michael evaluated and designed distributed file system metadata management systems. His publications prototype ideas on CephFS, the file system that uses the Ceph distributed object store. His lab also has a special interest in storage system programmability and reproducibility in systems research. At USRC, Michael is working with Brad Settlemyer on load balancing policies for HXHIM, an HPC key-value store.

Alexandra DeLucia,

Undergraduate Student, Rollins College


aadelucia@gmail.com
Alexandra is a former USRC student. She worked on creating a model of system logs from high performance computers. The model will later be used in anomaly detection.

Michael Lane,

Post Bachelor, Los Alamos National Laboratory


mlane@lanl.gov
Michael Lane is a former USRC student. He researched techniques to make exact computations on fault-prone hardware.

Kai Wu,

PhD Student, University of California, Merced


Visit Homepage
kwu42@ucmerced.edu
Kai is a former USRC student. His research broadly falls into general areas of High Performance Computing (Large-Scale Parallel Systems). Specifically, he focuses on the following areas:(i) Parallel programming models and runtime; (ii) Performance optimization and modeling; (iii) Resilience and Consistency; (iv)Non-volatile memory; (v) Fault Tolerance in Extreme-Scale Parallel Systems. At USRC, Kai is working on building faults model on serial codes and predicting the faults on parallel codes.

Olena Tkachenko,

Post Bachelor, Los Alamos National Laboratory


otkac001@fiu.edu
Olena is a former USRC student. At the USRC she is worked on predicting DRAM fault locations in HPC systems using structured learning and various ML techniques. Her research interests include HPC, ML, and fault prediction/mitigation.

Jian Peng,

PhD Student, Computer Science, Illinois Institute of Technology


jpeng10@hawk.iit.edu
Jian is a former USRC student. He worked on Burst Buffer simulation in Dragonfly network. The goal was to develop a simulator which models supercomputers with dragonfly network and burst buffer storage architecture.

Nathaniel Graham,

Graduate Student, Computer Science, UNM


ngraham@lanl.gov
Nathaniel Graham is a former USRC student. He worked on networking software. He provided bug fixes, documentation, and enhancements for Open MPI, as well as some work with UCX. He was also the primary maintainer of the Open MPI Java bindings.

Mitchell Klein,

Student, Los Alamos National Laboratory


mjklein@lanl.gov
​Mitchell Klein is a former USRC student. He worked on a project that tested the resilience of algorithms.

Emily Vecchia,

Undergraduate Student, University of St. Thomas


evecchia@lanl.gov
Emily Vecchia is a former USRC student. She worked on a probabilistic computing project testing the resilience of algorithms to faults.

Ryan Slechta,

Graduate Student, Ohio State University


rslechta@lanl.gov
Ryan Slechta is a fromer USRC student. He received his Bachelors degree in Mathematics and Computer Science from the University of St. Thomas in May 2016. He worked with NMC staff and scientists on problems of algorithmic resilience, and to improve the reliability of erasure code techniques.

Rusty H Davis,

Graduate Student, Clemson University


zopppo@gmail.com
Rusty is a fromer USRC student. He graduated with his B.S. in computer science​ from the School of Computing at Clemson University in May 2016. He will begin pursuing his masters of Computer Science at Clemson University in Fall 2016. Rusty's initial work was with Dr. Nathan DeBardeleben and Dr. William Jones concerning Algorithmic-Based Fault Tolerant Matrix Multiplication. He also focused on ​quantifying the resiliency of​ Algorithmic-Based Fault Tolerant Fast Fourier Transforms and creating an interface for the F-SEFI fault injector. His research interests include High Performance Computing, Operating Systems, and Resilience/Fault tolerance.

Scott Lavigne,

Graduate Student, Ohio State University


lavigne@lanl.gov
Scott is a former USRC student. While obtaining his BS in Computer Science with a minor in Applied Mathematics from Coastal Carolina University, Scott worked on various projects for the USRC, ranging from fault injection studies with F-SEFI to analyzing ECC of interest to the team. Scot went on to pursue a PhD at Ohio State.

Adam P Morrow,

Undergraduate Student, Brigham Young University


Adam is a former USRC student. He was an undergraduate intern at the New Mexico Consortium where he worked on mapping error clusters to originating faults and errors in DRAM units on leadership-class supercomputing machines.

Dr. Li Tan,

Postdoctoral Researcher, Los Alamos National Laboratory


Li Tan is a former USRC student. He graduated with a Ph.D. degree in Computer Science from University of California, Riverside (UCR) in 2015. His chief research interest is High Performance Computing (HPC), in particular improving resilience/reliability and energy/power efficiency for high performance scientific algorithms and applications, and software debugging in large-scale HPC environments. At USRC, he worked in fine-grained resilience and low-power modeling and provisioning for HPC applications, using fault injection and near-threshold voltage reduction techniques. He served as a reviewer for prestigious conferences and journals on high performance parallel and distributed computing, such as SC, IPDPS, PACT, CCGrid, IEEE TPDS, IJHPCA, and JSS. He is a recipient of Dean's Distinguished Fellowship from UCR in 2010. He is a Member of the IEEE and a Member of the ACM.

Jason Lee,

Master's Student, Computer Science, Florida State University


Jason Lee is a former USRC student. He was a master's student at Florida State University and received a B.S. from Rensselaer Polytechnic Institute. His interests include cryptography, parallel programming, and networking.

At the USRC he worked on software defined networking with Infiniband for high performance computing systems.


Song Huang,

Ph.D. Student, Department of Computer Science and Engineering, University of North Texas


SongHuang@my.unt.edu
Song Huang is a former USRC stduent. He was Ph.D. student in the Department of Computer Science and Engineering at the University of North Texas. He worked in the Dependable Computing Systems Lab directed by Dr. Song Fu. His research interests include power and energy consumption on the HPC system, disk failure modeling and analysis, resilience and fault tolerance techniques on the HPC system.

He worked at the USRC on characterizing the power consumption on the Haswell machines and resource allocation and scheduling on the HPC system.


Panruo Wu,

PhD Student, Computer Science & Engineering, University of California Riverside


pwu011@ucr.edu
Panruo Wu is a former USRC student. He got his bachelor’s degree in mathematics from University of Science and Technology of China in 2011. He went on to be a PhD candidate in UCR. His research interests include fault tolerance in parallel and distributed systems and numerical algorithms.

At the USRC he worked on F-SEFI fault injector and developing highly fault tolerant algorithms that can run correctly and efficiently in the presence of numerous architectural faults.


Nicholas Nelson,

Graduate Student, New Mexico State University


njnelson@nmsu.edu
Nicholas is a former USRC student. He received his bachelor's degree in micro-biology from Cornell University in 2010. After working in Flow Cytometry and Scanning Electron Microscopy he entered the computer science graduate program at New Mexico State University. Nicholas is researching parallel programing in high performance computing (HPC). At the USRC, Nicholas worked on benchmarking OpenSHMEM in contrast to MPI.

Zhou Tong,

PhD Student, Computer Science, Florida State University


tong@cs.fsu.edu
Zhou is a former USRC student who was a Ph.D. student in the Department of Computer Science at Florida State University, and he received his B.S. from Millsaps College. His research includes interconnection network, parallel application performance modeling and data mining. At the USRC, Zhou worked on performance evaluation and communication modeling on MPI applications to provide fast classification assessment based on the understanding of the characteristics of various parallel applications in production HPC systems.

Michael Carlton,

Undergraduate Student, University of Kentucky


mcarlton93@gmail.com
Michael Carlton is a former USRC student. He was an undergraduate with a dual major in Computer Engineering and Computer Science at the University of Kentucky. This summer at LANL he worked with his mentors, Nathan DeBardeleben and Sean Blanchard, to develop a new method for handling memory hardware errors. This project was a success and will hopefully be utilized in the near future on production machines.

Lewis Tseng,

PhD Student, Computer Science, University of Illinois at Urbana-Champaign


ltseng3@uiuc.edu
Lewis Tseng is a former USRC student. Lewis designed and developed a distributed key-value store to support service registration and lookup service for HPC systems. The main goals of the new key-value store are: (i) autonomous system management, (ii) configurable per-key consistency and replication at the data granularity, (iii) flexible and configurable deployment based on different policies for power, performance and resilience.

Zhenjie Chen,

PhD Student, Computer Science, University of New Mexico


Visit Homepage
zhenjie@cs.unm.edu
505 615-9186 (o)
Zhenjie is a former USRC student. At the USRC, Zhenjie worked on integrating Scalable Information Propagation service into LIBI(Lightweight Infrastructure-Bootstrapping Infrastructure), in order to improve the performance of bootstrapping numerous processes especially the wire-up procedure. 

Atiqul Mollah,

PhD Student, Florida State University


mollah@cs.fsu.edu
Atiqul is a former USRC student. At the USRC, he worked on implementation of dynamic and scalable services in HPC systems using OpenFlow/SDN.

Bin Huang,

PhD Student in Reconfigurable Computing System Lab, University of North Carolina at Charlotte


bin.arthur@gmail.com
Bin is a former USRC student. At the USRC, Bin conducted research on a resilient runtime system framework for heterogeneous many-core architecture which combines a few reliable cores and many less reliable cores.

Chao Chen,

PhD Student, Texas Tech University


chao.chen@ttu.edu
Chao Chen is a former USRC student. At the USRC he worked on developing and evaluating a new paradigm for Data-intensive applications to reduce the impact of I/O limitations.


Page 2 of 3

© 2018 New Mexico Consortium