pveclib.github.io

Repository of pveclib documentation for current releases

Header files that contain useful functions leveraging the PowerISA Vector Facilities: Vector Multimedia Extension (VMX AKA Altivec) and Vector Scalar Extension (VSX). Larger functions like quadword multiply and multiple quadword multiply and madd are large enough to justify CPU specific and tuned run-time libraries. The user can choose to bind to platform specific static archives or dynamic shared object libraries which automatically (dynamic linking with IFUNC resolves) select the correct implementation for the CPU it is running on.

The goal of this project to provide well crafted implementations of useful vector and large number operations:

Provide equivalent functions across versions of the PowerISA. For example the Vector Multiply-by-10 Unsigned Quadword operations introduced in PowerISA 3.0 (POWER9) can be implement in a few vector instructions on earlier PowerISA versions.
Provide equivalent functions across versions of the compiler. For example builtins provided in later versions of the compiler can be implemented as inline functions with inline asm in earlier compiler versions.
Provide higher order functions not provided directly by the PowerISA. For example vector SIMD implementation for ASCII __isalpha, etc. Another example full __int128 implementations of Count Leading Zeros, Population Count, and Multiply.
Provide optimized run-time libraries for quadword integer multiply and multi-quadword integer multiply and add.

Most PVECLIB operations are static inline implementations provided by PVECLIB header files: The headers are organized by element type:

vec_common_ppc.h; Typedefs and helper macros
vec_f128_ppc.h; Operations on vector _Float128 values
vec_f64_ppc.h; Operations on vector double values
vec_f32_ppc.h; Operations on vector float values
vec_int512_ppc.h; Operations on Multi-quadword integer values
vec_int128_ppc.h; Operations on vector __int128 values
vec_int64_ppc.h; Operations on vector long int (64-bit) values
vec_int32_ppc.h; Operations on vector int (32-bit) values
vec_int16_ppc.h; Operations on vector short int (16-bit) values
vec_char_ppc.h; Operations on vector char (8-bit) values
vec_bcd_ppc.h; Operations on vectors of Binary Code Decimal and Zoned Decimal values

PVECLIB now (v1.0.4) supports CPU tuned run-time libraries, both static archives and dynamic (IFUNC selected) shared objects. Currently this runtime supports the multiple quadword multiplies and add operations documented in the vec_int512_ppc header.

The current PVECLIB implementation assumes the target supports both VMX (Altivec) and VSX facilities. So the minimum targets are set internally (PVECLIB_DEFAULT_CFLAG) to '-mcpu=power7' for BE and '-mcpu=power8' for LE.

The default compiler is 'gcc'. The project can be configured to use the Clang / LLVM compiler using the CC=clang flag.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
search		search
LICENSE		LICENSE
README.md		README.md
annotated.html		annotated.html
bc_s.png		bc_s.png
bdwn.png		bdwn.png
classes.html		classes.html
closed.png		closed.png
deprecated.html		deprecated.html
dir_3653a864936a87c29f489ec2a5b8be1c.html		dir_3653a864936a87c29f489ec2a5b8be1c.html
dir_68267d1309a1af8e8297ef4c3efbcdba.html		dir_68267d1309a1af8e8297ef4c3efbcdba.html
dir_e68e8157741866f444e17edd764ebbae.html		dir_e68e8157741866f444e17edd764ebbae.html
doc.png		doc.png
doxygen.css		doxygen.css
doxygen.png		doxygen.png
dynsections.js		dynsections.js
files.html		files.html
folderclosed.png		folderclosed.png
folderopen.png		folderopen.png
functions.html		functions.html
functions_vars.html		functions_vars.html
globals.html		globals.html
globals_c.html		globals_c.html
globals_d.html		globals_d.html
globals_defs.html		globals_defs.html
globals_func.html		globals_func.html
globals_type.html		globals_type.html
globals_v.html		globals_v.html
globals_vars.html		globals_vars.html
index.html		index.html
jquery.js		jquery.js
menu.js		menu.js
menudata.js		menudata.js
nav_f.png		nav_f.png
nav_g.png		nav_g.png
nav_h.png		nav_h.png
open.png		open.png
pages.html		pages.html
pveclib.pdf		pveclib.pdf
pveclibmaindox_8h_source.html		pveclibmaindox_8h_source.html
splitbar.png		splitbar.png
struct____VEC__U__1024.html		struct____VEC__U__1024.html
struct____VEC__U__1152.html		struct____VEC__U__1152.html
struct____VEC__U__2048.html		struct____VEC__U__2048.html
struct____VEC__U__2176.html		struct____VEC__U__2176.html
struct____VEC__U__256.html		struct____VEC__U__256.html
struct____VEC__U__4096.html		struct____VEC__U__4096.html
struct____VEC__U__512.html		struct____VEC__U__512.html
struct____VEC__U__640.html		struct____VEC__U__640.html
sync_off.png		sync_off.png
sync_on.png		sync_on.png
tab_a.png		tab_a.png
tab_b.png		tab_b.png
tab_h.png		tab_h.png
tab_s.png		tab_s.png
tabs.css		tabs.css
todo.html		todo.html
union____VEC__U__1024x512.html		union____VEC__U__1024x512.html
union____VEC__U__128-members.html		union____VEC__U__128-members.html
union____VEC__U__128.html		union____VEC__U__128.html
union____VEC__U__2048x512.html		union____VEC__U__2048x512.html
union____VEC__U__4096x512.html		union____VEC__U__4096x512.html
union____VEC__U__512x1.html		union____VEC__U__512x1.html
union____VF__128-members.html		union____VF__128-members.html
union____VF__128.html		union____VF__128.html
vec__bcd__ppc_8h.html		vec__bcd__ppc_8h.html
vec__bcd__ppc_8h_source.html		vec__bcd__ppc_8h_source.html
vec__char__ppc_8h.html		vec__char__ppc_8h.html
vec__char__ppc_8h_source.html		vec__char__ppc_8h_source.html
vec__common__ppc_8h.html		vec__common__ppc_8h.html
vec__common__ppc_8h_source.html		vec__common__ppc_8h_source.html
vec__f128__ppc_8h.html		vec__f128__ppc_8h.html
vec__f128__ppc_8h_source.html		vec__f128__ppc_8h_source.html
vec__f32__ppc_8h.html		vec__f32__ppc_8h.html
vec__f32__ppc_8h_source.html		vec__f32__ppc_8h_source.html
vec__f64__ppc_8h.html		vec__f64__ppc_8h.html
vec__f64__ppc_8h_source.html		vec__f64__ppc_8h_source.html
vec__int128__ppc_8h.html		vec__int128__ppc_8h.html
vec__int128__ppc_8h_source.html		vec__int128__ppc_8h_source.html
vec__int16__ppc_8h.html		vec__int16__ppc_8h.html
vec__int16__ppc_8h_source.html		vec__int16__ppc_8h_source.html
vec__int32__ppc_8h.html		vec__int32__ppc_8h.html
vec__int32__ppc_8h_source.html		vec__int32__ppc_8h_source.html
vec__int512__ppc_8h.html		vec__int512__ppc_8h.html
vec__int512__ppc_8h_source.html		vec__int512__ppc_8h_source.html
vec__int64__ppc_8h.html		vec__int64__ppc_8h.html
vec__int64__ppc_8h_source.html		vec__int64__ppc_8h_source.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pveclib.github.io

About

Releases

Packages

Languages

License

open-power-sdk/pveclib.github.io

Folders and files

Latest commit

History

Repository files navigation

pveclib.github.io

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages