Skip to content
4ip_extention.rst 7.91 KiB
Newer Older
Victor's avatar
Victor committed
4IP-extention: Run on PCP machines
==================================

This document will describe the work done under the PRACE-4IP extention. This task is dedicated to run the accelerated UEABS on PCP systems to obtain energy metrics on OpenPower+GPU, Xeon Phi and FPGA.

Organisation is publicly available to anyone interrested in the project and rely on the following tools:
Victor's avatar
Victor committed
 * A slack channel for chat purpose: `PRACE PCP slack channel`_ , ask `Victor Cameo Ponz`_ for registration link.
Victor's avatar
Victor committed
 * A mailing list: ``TODO, adress & registration process``
 * This documentation.

The following methodology will be followed to lead this task:
 #. define which codes of UEABS will be run by who on which machines (this also meats the PRACE milestone ``TODO, name and PRACE calendar due`` due by August 2017)
 #. grant access to machines (cut off September 2017 )
 #. run benchmarks (cut off October/November)
 #. gather results and report (this adresses the PRACE deliverable ``TODO, name and PRACE calendar due`` due by December 2017)

Code definition
---------------

The table :ref:`table-code-definition` shows all codes availabe with UEABS (regular and accelerated). It states for each codes, tageted architechures and BCOs.

.. _table-code-definition:
.. table:: Code definition
   :widths: auto

Victor's avatar
Victor committed
   +------------------------+--------------------------------+-----------------------------+
   |                        |           Will run on          |                             |
   |   Code name            +--------------+----------+------+   4IP-extention BCO         +
   |                        | Power8 + GPU | Xeon Phi | FPGA |                             |
   +========================+==============+==========+======+=============================+
   | ALYA                   |              |          |      |                             |
   +------------------------+--------------+----------+------+-----------------------------+
   | Code_Saturne           |              |          |      | ???Charles Moulinec???      |
   +------------------------+--------------+----------+------+-----------------------------+
   | CP2K                   |              |          |      | Arno Proeme                 |
   +------------------------+--------------+----------+------+-----------------------------+
   | GADGET                 |              |          |      |                             |
   +------------------------+--------------+----------+------+-----------------------------+
   | GENE                   |              |          |      |                             |
   +------------------------+--------------+----------+------+-----------------------------+
   | GPAW                   |              |          |      | Martti Louhivuori (CINCA)   |
   +------------------------+--------------+----------+------+-----------------------------+
   | GROMACS                |              |          |      | Dimitris Dellis (GRNET)     |
   +------------------------+--------------+----------+------+-----------------------------+
   | NAMD                   |              |          |      | Dimitris Dellis (GRNET)     |
   +------------------------+--------------+----------+------+-----------------------------+
   | NEMO                   |              |          |      | ???EPCC???                  |
   +------------------------+--------------+----------+------+-----------------------------+
   | PFARM                  |              |          |      |                             |
   +------------------------+--------------+----------+------+-----------------------------+
   | QCD                    |              |          |      | Arno Proeme (EPCC)          |
   +------------------------+--------------+----------+------+-----------------------------+
   | Quantum Espresso       |              |          |      | Andrew Emerson (CINECA)     |
   +------------------------+--------------+----------+------+-----------------------------+
   | SHOC                   |              |          |      | Valeriu Codreanu (SurfSARA) |
   +------------------------+--------------+----------+------+-----------------------------+
   | Specfem3D_Globe        |    ✓         | ✓        |      | Victor Cameo Ponz (CINES)   |
   +------------------------+--------------+----------+------+-----------------------------+
Victor's avatar
Victor committed


PCP systems description and registration process
------------------------------------------------

This section describes the systems where BCOs have been granted access. **The registration process included whithin is aimed at BCOs only.**

The table :ref:`table-system-availability` sums up the when registration process start:

.. _table-system-availability:
.. table:: System availability
   :widths: auto

   +--------------+----------------------------+---------------+------------------------------------------------------+
   |  Technology  | Manufacturer               | Host          | Availability                                         |
   +==============+============================+===============+======================================================+
   | Power8 + GPU | `E4 computer engineering`_ | CINECA_ (It)  | Start migration of nodes to CINECA in June/July 2017 |
   +--------------+----------------------------+---------------+------------------------------------------------------+
   | Xeon Phi     | `Atos/Bull`_               | CINES_ (Fr)   | June 2017                                            |
   +--------------+----------------------------+---------------+------------------------------------------------------+
   | FPGA         | MAXELER_                   | JSC_ (De)     | August 2017                                          |
   +--------------+----------------------------+---------------+------------------------------------------------------+

Power8 + GPU
^^^^^^^^^^^^

This machine has been designed by `E4 computer engineering`_ and is hosted at CINECA_ in Bologna, Italy. In order to access the machine BCO should register on the `CINECA user datatabase`_ and ask `Victor Cameo Ponz`_ to be added to the 4IP-extention project.

Compute technology
""""""""""""""""""

Hardware features fat-nodes with the following design:
Victor's avatar
Victor committed
 * 45 nodes with x2 IBM POWER8 processors and x4 NVIDIA P100 GPU
 * intranode comunications integrated using NVLink
 * extranode comunications integrated using Infiniband ERD interconnect
Victor's avatar
Victor committed
 * CPU and GPU liquid cooling based on CoolIT_ solution

Energy sampling technology
""""""""""""""""""""""""""

Xeon Phi
^^^^^^^^

This machine has been designed by `Atos/Bull`_ and is hosted at CINES_ in Montpellier, France. In order to access the machine BCO should fill the request to open login form (`odt <cines-login-form-odt>`_ or `rtf <cines-login-form-rtf>`_) and send it back to `Victor Cameo Ponz`_.

Compute technology
""""""""""""""""""
Hardware features the following nodes:
Victor's avatar
Victor committed
 * 168 nodes with 1x Intel Xeon Phi 7250 processor (KNL)
 * Liquid cooled nodes and PSU
 * MooseFS I/O
Victor's avatar
Victor committed

Energy sampling technology
""""""""""""""""""""""""""

Victor's avatar
Victor committed

Victor's avatar
Victor committed
FPGA
^^^^

This machine has been designed by MAXELER_ and is hosted at JSC_ in Julich, Germany.

Compute technology
""""""""""""""""""

Victor's avatar
Victor committed
This small pilot system features:
 - 4 MPC-H servers including 2x MAX5 DFE and 2x Intel Xeon processors

Victor's avatar
Victor committed
Energy sampling technology
""""""""""""""""""""""""""



.. _Victor Cameo Ponz: cameo+4ip-extention@cines.fr

.. _PRACE PCP slack channel: https://prace-pcp.slack.com

.. _E4 computer engineering: https://www.e4company.com
.. _Atos/Bull: https://bull.com/
.. _MAXELER: http://maxeler.com/

.. _CINECA: http://hpc.cineca.it/
.. _CINES: https://www.cines.fr/
.. _JSC: http://www.fz-juelich.de/ias/jsc/EN/Home/home_node.html

.. _CINECA user datatabase: https://userdb.hpc.cineca.it/
.. _cines-login-form-odt: https://www.cines.fr/wp-content/uploads/2014/01/opening_renewal_login_2017.odt
.. _cines-login-form-rtf: https://www.cines.fr/wp-content/uploads/2014/01/opening_renewal_login_2017.rtf
Victor's avatar
Victor committed

.. _CoolIT: https://www.coolitsystems.com/
.. _Slurm: https://slurm.schedmd.com/