About
Husam leads the Systems Technology Team at Microsoft’s CO+I CTO Office focusing on…
Experience
Education
-
Binghamton University
Dissertation: Chip to Chiller Energy-Reliability Aware Empirical and Numerical Analyses of the Integrated Data Center.
-
Thesis: Numerical Simulation of Natural Convection Particulate Flow in Microchannel.
-
Volunteer Experience
Publications
-
Chip to Chiller Experimental Cooling Failure Analysis of Data Centers: The Interaction Between IT and Facility
IEEE TRANSACTIONS ON COMPONENTS, PACKAGING AND MANUFACTURING TECHNOLOGY
Cooling failure in data centers (DCs) is a complex phenomenon due to the many interactions between the cooling infrastructure and the information technology equipment (IT). To fully understand it, a system integration philosophy is vital to the testing and design of experiment. In this paper, a facility-level DC cooling failure experiment is run and analyzed. An airside cooling failure is introduced to the facility during two different cooling set points as well as in open and contained…
Cooling failure in data centers (DCs) is a complex phenomenon due to the many interactions between the cooling infrastructure and the information technology equipment (IT). To fully understand it, a system integration philosophy is vital to the testing and design of experiment. In this paper, a facility-level DC cooling failure experiment is run and analyzed. An airside cooling failure is introduced to the facility during two different cooling set points as well as in open and contained environments. Quantitative instrumentation includes pressure differentials, tile airflow, external contour and discrete air inlet temperature, intelligent platform management interface (IPMI), and cooling system data during failure recovery. Qualitative measurements include infrared imaging and airflow visualization via smoke trace. To our knowledge of current literature, this is the first experimental study in which an actual multi-aisle facility cooling failure is run with real IT (compute, network, and storage) load in the white space. This will establish a link between variations from the facility to the central processing unit (CPU). The results show that using the external IT inlet temperature sensors, the containment configuration shows a longer available uptime (AU) during failure. However, the IPMI data show the opposite. In fact, the available uptime is reduced significantly when the external sensors are compared to internal IT analytics. The response of the IT power, CPU temperature, and fan speed shows higher values during the containment failure. This occurs because of the instantaneous formation of external impedances in the containment during failure, which renders the contained aisle to be less resilient than the open aisle. The tradeoffs between PUE, OPEX, and AU are also explained.
Other authorsSee publication -
Analysis of Airflow Imbalances in an Open Compute High Density Storage Data Center
Applied Thermal Engineering
This article reports experimental and numerical testing performed to characterize the operation and reliability of the open compute (OC) storage system in contained environment from server to aisle levels. The study is comprised of three parts. The first part is an experimental analysis of the high density (HD) 3D array storage unit thermal and utilization responses during airflow imbalances. This is done with the stress test proposed for IT in containment to mimic possible mismatch and cascade…
This article reports experimental and numerical testing performed to characterize the operation and reliability of the open compute (OC) storage system in contained environment from server to aisle levels. The study is comprised of three parts. The first part is an experimental analysis of the high density (HD) 3D array storage unit thermal and utilization responses during airflow imbalances. This is done with the stress test proposed for IT in containment to mimic possible mismatch and cascade failure scenarios. It is found that downstream HDDs are most prone to overheating and loss in utilization during an airflow imbalance. This was proven to undermine the storage capacity of the hard disk drives. An IT level airflow prediction model is discussed for the storage unit and validated for different fan speeds. In the second part, a computational fluid dynamics model is created for a high density open rack based on the active flow curve method. Here, the measured airflow response curves for the open compute IT(storage and compute servers) are used to build compact models and run rack level testing for IT air systems sensitivity and create a rack level AFC (active flow curve) airflow demand prediction model. Finally, the experimental characterization data is used to build an aisle level model (POD) that incorporates IT fan control systems (FCS). This modeling approach yields shorter uptime during chiller failure due to increased recirculation induced by increased IT airflow demand during cases such as chiller failure or high economizer temperatures.
-
Chip to Facility Ramifications of Containment Solution on IT Airflow and Uptime
IEEE Transactions on Components, Packaging and Manufacturing Technology
We developed a generalized testing methodology in which IT fans' internal and external impedance effects are collapsed into a single curve (i.e., flow curve). Such curves can give accurate flow rate predictions of the IT equipment based on RPM logging. Flow curves cover the three possible airflow regions for IT in a contained environment. The experimental procedure is introduced to rank IT equipment based on its air systems, and the flow regions are correlated with the CPU performance. The…
We developed a generalized testing methodology in which IT fans' internal and external impedance effects are collapsed into a single curve (i.e., flow curve). Such curves can give accurate flow rate predictions of the IT equipment based on RPM logging. Flow curves cover the three possible airflow regions for IT in a contained environment. The experimental procedure is introduced to rank IT equipment based on its air systems, and the flow regions are correlated with the CPU performance. The strength of the IT air system is characterized by the free delivery and the critical pressure points. In addition, the experimentally obtained active flow curves can be used as a numerical boundary condition in various containment designs. Accordingly, for the first time in the literature, the aerodynamic interaction of IT with different air systems in the containment (i.e., BladeCenters, servers, and switches) can numerically be investigated. It is found that stacking IT in a contained environment with no physical consideration can lead to radical reliability problems. The region of reverse flow is discovered to be very possible upon events in the data center. In such a case, the weak IT (e.g., 1U switch) behaves as an open leakage into the sealed containment. On the other hand, at the critical pressure point, minimum forced convection occurs in the weak IT, leading to CPU overheating.
Other authorsSee publication -
Performance of Temperature Controlled Perimeter and Row-Based Cooling Systems in Open and Containment Environment
Proceedings of the ASME 2015 International Mechanical Engineering Congress and Exposition IMECE2015, At Houston, Texas, USA
In a previous study, the interaction of perimeter and row based cooling were contrasted in a CAC environment, however, emphasis is needed still on how each system behaves in entrainment conditions due leakage. Also, benefits of containing a row based cooled aisle has not been compared with the aisle in open conditions. Sets of IT that differs in air system strength are used as load to be cooled by both cooling systems. Imperially measured flow curves of IT equipment are employed to simplified…
In a previous study, the interaction of perimeter and row based cooling were contrasted in a CAC environment, however, emphasis is needed still on how each system behaves in entrainment conditions due leakage. Also, benefits of containing a row based cooled aisle has not been compared with the aisle in open conditions. Sets of IT that differs in air system strength are used as load to be cooled by both cooling systems. Imperially measured flow curves of IT equipment are employed to simplified IT equipment model in the CFD code. This study numerically investigates the performance of in-row coolers in both opened and cold contained aisles. The steady state analysis includes manufacturer’s specifications such as heat exchanger performance. The CFD model was validated using a new data center laboratory at Binghamton University with perimeter cooling. A Libert® CW114 is the sole cooling unit in the data center. In this study, a sliced part of the data center is studied in order to generalize the results and reduce the computational time. 16 racks are distributed on two sides of the cold aisle that contains 2 power distribution units and built in full details in the CFD code. This study elucidates on the particular behavior of airflow in the cold aisle when row-based cooling is utilized. This is mainly attributed to the change in the location of the airflow source. Also, correlations are derived to predict the ability of ACUs to maintain set points at different air flows.
Other authors -
RANKING AND OPTIMIZATION OF CAC AND HAC LEAKAGE USING PRESSURE CONTROLLED MODELS
Proceedings of the ASME 2015 International Mechanical Engineering Congress and Exposition IMECE2015, At Houston, Texas, USA
In cold aisle containment (CAC) the supply of cold air is separated within the contained volume. The hot air exhaust leaves the IT and increases the room’s temperature before returning to the cooling unit. On the other hand, hot aisle containment (HAC) generates a cooler environment in the data center room as a whole by segregating hot air within the containment. Hot air is routed back to the cooling unit return by a drop ceiling or a chimney. Each system has different characteristics and…
In cold aisle containment (CAC) the supply of cold air is separated within the contained volume. The hot air exhaust leaves the IT and increases the room’s temperature before returning to the cooling unit. On the other hand, hot aisle containment (HAC) generates a cooler environment in the data center room as a whole by segregating hot air within the containment. Hot air is routed back to the cooling unit return by a drop ceiling or a chimney. Each system has different characteristics and airflow paths. For instance, leakage introduces different effects for CACs and HACs since the hot and cold aisles are switched.
This article utilizes data center measurements and containment characterization carried out circa April 2015 in the ES2 Data Center lab at Binghamton University. Details on the containment model include leakages at below racks, above racks, below CAC doors, between doors, and above doors. The model deploys the experimentally obtained flow curves approach for flow-pressure correlation.
Data center operators rely on the pressure differential to measure how much the IT is provided. Hence, in this study the level of provisioning was expressed in terms of pressure differentials between the hot and cold aisles. In this manner the model reflected real-life DC thermal management practices. This was done by integrating a pressure differential based controller to the cooling unit model. Leakages in each system are quantified and ranked based on a proposed LIF (Leakage Impact Factor) metric.
The LIF describes the transport contribution each leakage location has. This metric can be used by containment designers and data center operators to prioritize their sealing efforts or consider deploying the containment solution differently. Finally, a systematic approach is shown in which containment models can be used to optimize operations at the real-life site. -
Experimental And Numerical Characterization Of A Raised Floor Data Center Using Rapid Operational Flow Curves Model
Proceedings of the ASME 2015 International Technical Conference and Exhibition on Packaging and Integration of Electronic and Photonic Microsystems InterPACK2015
As the number of data centers is exponentially growing globally, pragmatic characterization schemes are considered to be a necessity for measuring and modeling the load capacity and flow pattern of the facility. This paper contains experimental and numerical characterization of a new data center laboratory using practical measurements methods, including tiles and CRAH flow measurements. Then a full physics based CFD model is built to simulate/predict the measured data. A rapid flow curve method…
As the number of data centers is exponentially growing globally, pragmatic characterization schemes are considered to be a necessity for measuring and modeling the load capacity and flow pattern of the facility. This paper contains experimental and numerical characterization of a new data center laboratory using practical measurements methods, including tiles and CRAH flow measurements. Then a full physics based CFD model is built to simulate/predict the measured data. A rapid flow curve method is used showing high accuracy and low computational expense.
Detailed descriptions of the data center structure, dimensions, layout (Appendix, A-1) and flow devices are given. Also, modeling parameters are mentioned in details to provide a baseline for any investigative parametric or sensitivity studies. Four experimental room level flow constraint scenarios are applied at which measurements were taken, (Appendix, A-2). The model is then built and calibrated then used to predict measurements.
Measurements of the cooling unit were performed using hot wire anemometry with a traverse duct installed at the top of the CRAH. The tiles measurements were carried out using a flow hood with back pressure compensation. A detailed CFD model is constructed to predict the four experimental cases. For modeling the interdependency between the flow and pressure in flow devices flow curve approach is used. This is a rapid modeling technique that relies on experimentally measured (for IT) or approximated (for CRAH) flow curves. Applying the operational flow curves boundary conditions at the vents of the flow device results in a very accurate simulation model. It is also shown that the flow curves can be used to predict the real-time flow rate of servers at known RPM. This greatly simplifies flow rate measurements of IT in the data center.Other authors -
Experimentally Validated Numerical Model of a Fully- Enclosed Hybrid Cooled Server Cabinet
Proceedings of the ASME 2015 International Technical Conference and Exhibition on Packaging and Integration of Electronic and Photonic Microsystems InterPACK2015
Because of the rapid growth in the number of data centers combined with the high density heat dissipation in the IT and telecommunications equipment, energy efficient thermal management of data centers has become a key research focus in the electronics packaging community. Traditional legacy data centers still rely largely on chilled air flow delivered to the IT equipment racks through perforated tiles from the raised floor plenum. When there is large variation in the amount of heat dissipated…
Because of the rapid growth in the number of data centers combined with the high density heat dissipation in the IT and telecommunications equipment, energy efficient thermal management of data centers has become a key research focus in the electronics packaging community. Traditional legacy data centers still rely largely on chilled air flow delivered to the IT equipment racks through perforated tiles from the raised floor plenum. When there is large variation in the amount of heat dissipated by the racks in a given aisle, the standard air cooling approach requires over-provisioning.
Localized hybrid air-water cooling is one approach to more effectively control the cooling when there is wide variation in the amount of dissipation in neighboring racks. In a closed hybrid air-water cooled server cabinet, the generated heat is removed by a self-contained system that does not interact with the room level air cooling system. In this study, a comprehensive procedure for CFD validation in a close coupled hybrid cooled enclosed cabinet is described. The commercial enclosure has been characterized experimentally in an earlier study, where the effectiveness values were applied as boundary conditions to the compact heat exchanger model.
Here, the previously obtained experimental data are used to validate the results from computational modeling. Two cases with different air flow rates are compared. Very good agreement is achieved, with the maximum overall average error less than 4%. Due to relatively high pressure inside the cabinet, it is possible that air leakage from the cabinet may be responsible for the discrepancy between the model and experimental results. A sensitivity study was applied to the validated model to investigate the effect leakage had on the cabinet’s performance.Other authors -
Steady state and transient comparison of perimeter and row-based cooling using controlled cooling curves
Proceedings of the ASME 2015 International Technical Conference and Exhibition on Packaging and Integration of Electronic and Photonic Microsystems InterPACK2015
A brief failure scenario is then run, and duration of ride through time in the case of row-based cooling system failure is compared to raised floor perimeter cooling with containment. Non-raised floor cooling schemes will reduce the air volumetric storage of the whole facility (in this small data center cell it is about a 20% reduction). Also, the varying thermal inertia between the typical in-row and perimeter cooling units is of decisive importance.
The CFD model is validated using a new…A brief failure scenario is then run, and duration of ride through time in the case of row-based cooling system failure is compared to raised floor perimeter cooling with containment. Non-raised floor cooling schemes will reduce the air volumetric storage of the whole facility (in this small data center cell it is about a 20% reduction). Also, the varying thermal inertia between the typical in-row and perimeter cooling units is of decisive importance.
The CFD model is validated using a new data center laboratory at Binghamton University with perimeter cooling. This data center consists of one main Liebert cooling unit, 46 perforated tiles with 22% open area, 40 racks distributed on three main cold aisles C and D. A computational slice is taken of the data center to generalize results. Cold aisle C consists of 16 rack and 18 perforated tiles with containment installed. In-row coolers are then added to the CFD model. Fixed IT load is maintained throughout the simulation and steady state comparisons are built between the legacy and row-based cooling schemes. An empirically obtained flow curve method is used to capture the flow- pressure correlation for flow devices.
Performance scenarios were parametrically analyzed for the following cases: (a) Perimeter cooling in CAC, (b) In-row cooling in CAC. Results showed that in-row coolers increased the efficiency of supply air flow utilization since the floor leakage was eliminated, and higher pressure build up in CAC were observed. This reduced the rack recirculation when compared to the perimeter cooled case. However, the heat exchanger size demonstrated the limitation of the in-row to maintain controlled set point at increased air flow conditions. For the pump failure scenario, experimental data provided by Emerson labs were used to capture the thermal inertia effect of the cooling coils for in-row and perimeter unit, perimeter cooled system proved to have longer ride through time.Other authors -
Innovative Approaches of Experimentally Guided CFD Modeling for Data Center
Thermal Measurement, Modeling & Management Symposium (SEMI-THERM), 2015 31st
This study illustrates basic guidelines for integrating
empirical measurements to state of the art modeling
approaches; that can be used to improve the accuracy of
physics based data center CFD models. Experimental
measurements gathered in a new research dedicated data
center laboratory are used to guide an advanced approach to
CFD modeling, the intent is to come up with a validated
predictive model of the data center. One of the goals of this
study is to provide…This study illustrates basic guidelines for integrating
empirical measurements to state of the art modeling
approaches; that can be used to improve the accuracy of
physics based data center CFD models. Experimental
measurements gathered in a new research dedicated data
center laboratory are used to guide an advanced approach to
CFD modeling, the intent is to come up with a validated
predictive model of the data center. One of the goals of this
study is to provide scientists and engineers with a clear set of
design guidelines; that allow for the introduction of important
physical details into the model which have previously been
largely overlooked. This approach will establish a valid
baseline for future designs of data centers as well as future
research projects in the data center laboratory. The current
CFD model incorporates important physical parameters that
affect the accuracy of the results. These include the plenum
floor jacks’ hydraulic resistance, the impact of cutouts, exact
locations of supply vents, CRAH (computer room air handler)
unit blower jet effects and momentum transfer through the
perforated tiles. Other details of the CRAH unit such as the
detailed supply vents, blower specifications, turning vent and
angle of flow are included; also rack detailed structure
including the mounting rails, cable penetration and PDUs
(power distribution units). CFD modeling results are
compared to the measured tiles flow rates from several
different experimental conditions. The models were refined
based on the experimental data and comparisons of the CFD
versus experimental tiles flow data showed an overall average
error of ~3%.Other authorsSee publication -
Numerical Investigation of Underfloor Obstructions in Open-Contained Data Center with Fan Curves
Thermal and Thermomechanical Phenomena in Electronic Systems (ITherm), 2014 IEEE Intersociety Conference on
The purpose of under floor plenum in a typical raised floor
data center is to route the supply of cold air to perforated tiles
in the cold aisles, and hence, to the racks. However, the
presence of under floor chiller piping and various wiring may
have an adverse effect on flow rates if not placed based on
physical considerations; the pressure drop caused by chiller
piping and under floor blockages has not been investigated
thoroughly in modeling of a fully representative…The purpose of under floor plenum in a typical raised floor
data center is to route the supply of cold air to perforated tiles
in the cold aisles, and hence, to the racks. However, the
presence of under floor chiller piping and various wiring may
have an adverse effect on flow rates if not placed based on
physical considerations; the pressure drop caused by chiller
piping and under floor blockages has not been investigated
thoroughly in modeling of a fully representative real life data
center, and in particular, in applications where some or all of
the cold aisles may be contained. This effect on flow rate is
expected to be even more profound in contained systems; this
study aims to address the effect of under floor obstructions on
data center performance.
It was shown that when having blockages in critical
locations, containment can act as a solution for the inlet
temperatures at the racks, however, the blockage effect can
still be seen on the racks outlet and the CRAC return
temperatures. It was also observed that inadequate distribution
of those blockages led to a change in the operating point of
both the CRAC and severs fan curve reducing the flow being
fed to the IT equipment, and hence the chiller cooling load is
expected to increase, which results in a thermal deficiency of
the data center.Other authors
Patents
-
Accelerated rack acclimator
US 10667435
-
CARBON CAPTURE SYSTEM
US-20230201757
-
Control systems and prediction methods for IT cooling performance in containment
US 15879163
-
Control systems and prediction methods for it cooling performance in containment
US-11076509-B2
-
Disaggregated computer systems
US-11567548-B2
-
Disaggregated computer systems
US-11392184-B2
-
Energy regeneration in fuel cell-powered datacenter with thermoelectric generators
US 10,998,483
-
Immersion Cooling Infrastructure Module Having Compute Device Form Factor
US 10750637
-
Immersion cooling enclosures with insulating liners
US-11051426-B2
-
Immersion cooling enclosures with insulating liners
US 10765033
-
Immersion cooling enclosures with insulating liners 2
US 11,051,426
-
Internal Server Air Quality Station
US 16161044
-
Power supply mode selection
US 15957675
-
Scalable thermal ride-through for immersion-cooled server systems
US-11357130-B2
-
Self-contained immersion cooling server assemblies
US 10,925,188
-
Self-contained immersion cooling server assemblies
US-11483949-B2
-
Server testing applying controlled flow and temperature
US 15969634
-
Systems and methods for cooling a computing system
US-11675400-B2
-
Systems and methods for immersion-cooled datacenters
US-11606878-B2
-
Systems and methods for immersion-cooled datacenters
US-11533829-B2
-
Thermally conductive microtubes for evenly distributing heat flux on a cooling system
US-11653475-B2
Honors & Awards
-
IEEE Micro Top Picks 2021
IEEE Micro
https://ieeexplore.ieee.org/abstract/document/9749826
-
DCD Global Awards 2021 | Mission Critical Tech Innovation Award
DCD Global Awards
Microsoft Cloud-Production Two Phase Immersion Cooling Deployment
-
CO+I Culture Champion
Microsoft’s Cloud Operations and Innovation CO+I
-
Outstanding Paper Award - ASME International Technical Conference on Packaging and Integration of Electronic and Photonic Microsystems
ASME 2018 InterPACK, San Francisco CA.
Impact of Internal Design on the Efficiency of IT equipment in a Hot Aisle Containment System - An Experimental Study
-
Early Career Achievement Award
New York State Assembly
-
Best Paper Award - IEEE Electronics Packaging Society, Transactions on Components, Packaging and Manufacturing Technology
CPMT IEEE, ECTC, Orlando, FL.
Chip to Chiller Experimental Cooling Failure Analysis of Data Centers: The Interaction Between IT and Facility
-
Best Paper in the Area of Energy-Smart Electronic Systems Technology and/or Application Presented at SEMI-THERM 33
SEMITHERM, SAN JOSE, CA
Impact of Elevated Temperature on Data Center Operation Based on Internal and External IT Instrumentation
-
Distinguished Dissertation Award
S3IP A New York Center of Excellence Binghamton University
Chip to Chiller Energy-Reliability Aware Empirical and Numerical Analyses of the Integrated Data Center
-
POSTER CONTEST WINNER
IAB [Industrial Advisory Board] Meeting, NJ- ES2 Center for Energy-Smart Electronic Systems
Experimentally Validated Numerical Model of a Fully-Enclosed Hybrid Cooled Server Cabinet
-
POSTER CONTEST WINNER
IAB [Industrial Advisory Board] Meeting, UT Arlington, TX- ES2 Center for Energy-Smart Electronic Systems
Experimentally Guided Advances of CFD Modeling of Air-cooled Data Centers in a Raised Floor Setting
Organizations
-
IEEE
Member
- Present -
ASHRAE TC9.9
Member
- Present -
OCP
-
- Present -
ASME
Member
-
Electronics Packaging Society (EPS) - IEEE
-
-
The Green Grid
-
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More