Strumenti Utente

Strumenti Sito


roberto.alfieri:pub:vectorization

Differenze

Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.

Link a questa pagina di confronto

Entrambe le parti precedenti la revisioneRevisione precedente
Prossima revisione
Revisione precedente
roberto.alfieri:pub:vectorization [13/06/2017 19:45] โ€“ roberto.alfieriroberto.alfieri:pub:vectorization [14/06/2017 11:30] (versione attuale) โ€“ roberto.alfieri
Linea 1: Linea 1:
 ====== Vectorization ====== ====== Vectorization ======
 +
 +[[ http://www.training.prace-ri.eu/uploads/tx_pracetmo/intel_mic_optimization.pdf | Vectorization and Code Optimization ]]
 +
  
 Processor peak performance includes the speed-up provided by the vector instructions,  Processor peak performance includes the speed-up provided by the vector instructions, 
Linea 13: Linea 16:
  
 {{:roberto.alfieri:pub:waytovectorize.png?200|}} {{:roberto.alfieri:pub:waytovectorize.png?200|}}
 +
 +
 +
 +Auto-vectorization is the easiest and more portable way to get vectorization.
 +
 +The compiler recognize several vectiorization options.
 +
 +Main vectorization options: 
 +
 +^      Intel compiler   ^  
 +^ KNL      | -xMIC-AVX512 
 +^ BDW      | -xCORE-AVX2   |
 +^ Disable  | -no-vec       |
 +
 +Not all loops can be vectorized:
 +
 +Some examples: 
 +
 +  * Loop with dependencies between iterations
 +
 +<code>
 +for (i=1; i<MAX; i++) {
 +   d[i] = e[i] โ€“ a[i-1];
 +   a[i] = b[i] + c[i];
 +}
 +</code>
 +
 +
 +  * Complex loops
 +
 +  * Function calls inside the loop:
 +
 +      for (int i = 0; i < N; i++)   a[i] = foo(b[i]);
 +
 +  * Loops on data that are not aligned in memory    
 +  
roberto.alfieri/pub/vectorization.1497375933.txt.gz ยท Ultima modifica: 13/06/2017 19:45 da roberto.alfieri

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki