Commit 3dcd2ba0 authored by Victor's avatar Victor
Browse files

NEW knl informations

parent b23088f2
Xeon Phi
^^^^^^^^
This machine has been designed by `Atos/Bull`_ and is hosted at CINES_ in Montpellier, France.
This machine has been designed by `Atos/Bull`_ and is hosted at CINES_ in Montpellier, France. It is made of 76 Bull Sequana X1210 blades, each including 3 Xeon Phi KNL nodes. It totals a theoretical peak performance of 465 Tflop/s with an estimated consumption of 42kW.
.. note::
In order to access the machine BCOs should fill the CINES login opening form: `odt <cines-login-form-odt_>`_ or `rtf <cines-login-form-rtf_>`_.
In order to access the machine BCOs should fill the `GENCI login opening form`_.
Use the following information to fill project related fields:
- project outside DARI
......@@ -19,14 +19,25 @@ This machine has been designed by `Atos/Bull`_ and is hosted at CINES_ in Montpe
Compute technology
""""""""""""""""""
Hardware features the following nodes:
* 168 nodes with 1x Intel Xeon Phi 7250 processor (KNL)
* Liquid cooled nodes and PSU
* 168 nodes with
* 1x Intel Xeon Phi 7250 processor (KNL), 68 cores cadenced to 1.4 GHz with SMT 4.
* 96GB memory, 16GBx6 DDR4 DIMMs
* intranode comunications integrated using InfiniBand EDR
* 100% Hot water cooled nodes
* Half of the configuration feature liquid cooled Power Supply Unit (PSU) make this part of the machine 100% liquid cooled.
* MooseFS I/O
Each compute node has a theoritical peak performance of 2.765 TFlop/s (double precision) and a power consumption of less than 250W.
Energy sampling technology
""""""""""""""""""""""""""
Current and voltage sensors based on Hall-effect sensor with a linear sensitivity in range 0-200 A. The sampling occurs at the node level at up to the frequency of 50 kHz and is provided through a HDEEM FPGA on each node.
.. _GENCI login opening form: https://www-dcc.extra.cea.fr/CCFR/
.. _cines-login-form-odt: https://www.cines.fr/wp-content/uploads/2014/01/opening_renewal_login_2017.odt
.. _cines-login-form-rtf: https://www.cines.fr/wp-content/uploads/2014/01/opening_renewal_login_2017.rtf
.. _Atos/Bull: https://bull.com/
......
Performance and energy metrics on PCP systems - D7.7
====================================================
.. _d77:
Deliverable 7.7: Performance and energy metrics on PCP systems
==============================================================
Introduction
************
......@@ -20,7 +22,7 @@ Access to machines
Performances and energy metrics of EUABS on PCP systems
****************************
*******************************************************
intro: ref to previous D7.5 & EUABS
mix of the two. explain that some are newly ported to accelerators
......
Power8 + GPU
^^^^^^^^^^^^
D.A.V.I.D.E has been designed by `E4 computer engineering`_ and is hosted at CINECA_ in Bologna, Italy. It totals a theoritical peak performance of 990 TFlops and an estimated power consumption of less than 90kW. A more detailed description can be found on the `E4 dedicated webpage`_.
D.A.V.I.D.E has been designed by `E4 computer engineering`_ and is hosted at CINECA_ in Bologna, Italy. It totals a theoritical peak performance of 990 TFlop/s (double precision) and an estimated power consumption of less than 100kW. A more detailed description can be found on the `E4 dedicated webpage`_.
.. note:: In order to access the machine BCO should send an email to `Victor Cameo Ponz`_ so that.
......@@ -11,13 +11,13 @@ Compute technology
Hardware features fat-nodes with the following design:
* 45 nodes with
* x2 IBM POWER8+ processors, ie 8x2 cores with Simultaneous Multi-Threading (SMT) 8
* x4 NVIDIA P100 GPU with 16Go High Bandwidth Memory 2 (HBM2)
* x4 NVIDIA P100 GPU with 16GB High Bandwidth Memory 2 (HBM2)
* intranode comunications integrated using NVLink
* extranode comunications integrated using Infiniband ERD interconnect in fat-tree with no oversubscription topology
* CPU and GPU direct hot water (~27°C) cooling, removing 75-80% of the total heat
* remaining heat is air-cooled
* remaining 20-25% heat is air-cooled
Each compute node has a theoritical peak performance of 22 TFLOPS (double precision) and a power consumption of less than 2kW.
Each compute node has a theoritical peak performance of 22 Tflop/s (double precision) and a power consumption of less than 2kW.
Energy sampling technology
""""""""""""""""""""""""""
......
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment