Showing 20 open source projects for "simd"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Find Hidden Risks in Windows Task Scheduler Icon
    Find Hidden Risks in Windows Task Scheduler

    Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

    Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
    Download Free Tool
  • 1
    XNNPACK

    XNNPACK

    High-efficiency floating-point neural network inference operators

    ...Rather than serving as a standalone ML framework, XNNPACK provides high-performance computational primitives—such as convolutions, pooling, activation functions, and arithmetic operations—that are integrated into higher-level frameworks like TensorFlow Lite, PyTorch Mobile, ONNX Runtime, TensorFlow.js, and MediaPipe. The library is written in C/C++ and designed for maximum portability, efficiency, and performance, leveraging platform-specific instruction sets (e.g., NEON, AVX, SIMD) for optimized execution. It supports NHWC tensor layouts and allows flexible striding along the channel dimension to efficiently handle channel-split and concatenation operations without additional cost.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    OpenGL Mathematics

    OpenGL Mathematics

    Highly Optimized Graphics Math (glm) for C

    Highly optimized 2D|3D math library, also known as OpenGL Mathematics (glm) for `C`. cglm provides lot of utils to help math operations to be fast and quick to write. It is community-friendly, feel free to bring any issues, bugs you faced. Almost all functions (inline versions) and parameters are documented inside the corresponding headers. OpenGL-related functions are dropped to make this lib platform/third-party independent. Make sure you have the latest version and feel free to report...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    UniSIMD-assembler

    SIMD macro assembler unified for ARM, MIPS, PPC and x86

    UniSIMD assembler is a high-level C/C++ macro assembler framework unified across ARM, MIPS, POWER and x86 architectures. It establishes a subset of both BASE and SIMD instruction sets with clearly defined common API, so that application logic can be written and maintained in one place without code replication. The assembler itself isn't a separate tool, but rather a collection of C/C++ header files, which applications need to include directly in order to use. At present, Intel SSE/SSE2/SSE4 and AVX/AVX2/AVX-512 (32/64-bit x86 ISAs), ARMv7 NEON/NEONv2, ARMv8 AArch32 and AArch64 NEON, SVE (32/64-bit ARM ISAs), MIPS 32/64-bit r5/r6 MSA and POWER 32/64-bit VMX/VSX (little/big-endian ISAs) are mostly implemented (/w horizontal reductions) although scalar improvements, wider SIMD vectors with zeroing/merging predicates in 3/4-operand instructions are planned as extensions to current 2/3-operand SPMD-driven vertical SIMD ISA. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Vector Pascal is a language targeted at SIMD multi-core instruction-sets such as the AVX and SSE2 or x86-64-v3. It has a SIMD compiler which supports parallel vector operations, loop unrolling, common sub expression removal etc. It is implemented in Java.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    sleef

    sleef

    Vectorized libm

    SLEEF stands for SIMD Library for Evaluating Elementary Functions. SLEEF implements vectorized versions of all C99 math functions, that utilize SIMD instructions of modern processors to make computation more efficient. The library also includes vectorized DFT subroutines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    LinAsm

    LinAsm

    Collection of fast and optimized assembly libraries for x86-64 Linux

    LinAsm is collection of very fast and SIMD optimized assembly written libraries for x86-64 Linux. It implements many common and widely used algorithms for array manipulations: searching, sorting, arithmetic and vector operations, unit conversions; fast mathematical and statistic functions; numbers and time converting algorithms; finite impulse response (FIR) digital filters; spectrum analysis algorithms, Fast Hartley transformation; CPU cache friendly functions and extremely fast abstract data types (ADT) such as hash tables b-trees, and much more. ...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 7
    libjpeg-turbo

    libjpeg-turbo

    SIMD-accelerated libjpeg-compatible JPEG codec library

    libjpeg-turbo is a JPEG image codec that uses SIMD instructions (MMX, SSE2, NEON, AltiVec) to accelerate baseline JPEG compression and decompression on x86, x86-64, ARM, and PowerPC systems. On such systems, libjpeg-turbo is generally 2-6x as fast as libjpeg, all else being equal. On other types of systems, libjpeg-turbo can still outperform libjpeg by a significant amount, by virtue of its highly-optimized Huffman coding routines.
    Leader badge
    Downloads: 45,822 This Week
    Last Update:
    See Project
  • 8
    TurboPFor

    TurboPFor

    Fastest Integer Compression

    Fastest Integer Compression. ALL functions are available for AMD/Intel, 64-bit ARMv8 NEON Linux+MacOS/M1 & Power9 Altivec. 100% C (C++ headers), as simple as memcpy. OS:Linux amd64, arm64, Power9, MacOs (Amd/intel + Apple M1).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    smartIDS

    Lightweight intrusion detection for IoT and embedded devices.

    The aim of the project is a lightweight intrusion detection library for embedded devices which supports MSP430 and ARM Cortex based devices. Features include DSP/SIMD support, IoT and embedded protocols, distributed operation, event and history management, tool supported configuration and visualization. There is a Java port that supports less features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
    Try for free
  • 10
    jpegant

    jpegant

    Embedded JPEG encoder

    ...Release 1.1 source code and Windows executables are on the download page. For the latest release code look into the repository in 'release-1-0' branch. The SSE2 inplementation is present in 'simd.0' branch in the repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DD-AVX

    DD-AVX

    DD-AVX: Library of high-precision operations accelerated by AVX

    ...author's page http://www.slis.tsukuba.ac.jp/~s1530534/index.html e-mail hishinuma@slis.tsukuba.ac.jp Double-Double (DD) precision operations are used to reduce rounding errors and improve the convergence of Krylov subspace methods. This library has Double-Double precision operations accelerated by AVX and AVX2. AVX and AVX2 are intel SIMD instructions. They operate four double precision operation simultaneously. This library needs to merge "Lis" for DD precision iterative solvers. Official support "Lis" version is "lis-1.4.58." After merge "lis" and "DD-AVX," the interface is same as "lis." Lis web page http://www.ssisc.org/lis/index.en.html This library is managed by git. old version can search "code" page.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12

    libsombrero

    Astronomical object/structure detection from 1D and 2D data sets.

    Sombrero is a fast wavelet image processing and object detection C library for astronomical images. Sombrero is named after the "Mexican Hat" shape of the wavelet masks used in image convolution and is released under the GNU LGPL library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Mathematical library utilising SIMD features of common processors to accelerate many commonly-used algorithms where compilers fear to tread.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Virtual Lighttable and Darkroom
    Darktable is a virtual lighttable and darkroom for photographers: it manages your digital negatives in a database and lets you view them through a zoomable light table. It also enables you to develop raw images and enhance them.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 15
    An efficient implementation of the Smith-Waterman algorithm that takes advantage of SIMD instruction sets in modern CPUs. The Smith-Waterman algorithm is used for sequence alignment in bioinformatics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SSEPlus is a SIMD function library. It provides optimized emulation for newer SSE instructions. It also provides a rich set of high performance routines for common operations such as arithmetic, bitwise logic, and data packing and unpacking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Low-level processor benchmark for x86 and amd64 processors: measures exact latency and throughput for each assembly instruction, and automatically finds execution units. Special focus on SIMD (MMX, SSE) instructions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    This library is meant for high performance calculations for science or 3D games/rasterizers using SIMD instructions of x86 processors to allow an unparalleled level of optimization. This takes advantage of MMX, 3DNow!, 3DNow!+/MMX+, & SSE/SSE2/SSE3/SSSE3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A cross-platform, cross-compiler, cross-CPU C header library for programming with SIMD instruction sets. X86 (MMX/SSE/SSE2) GCC and MSVC, PPC Altivec GCC, WMMX ARM GCC, and software emulated SIMD are supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Simd oriented Fast Mersenne Twister Random generator for Cell processor This library is an implementation of the SFMT http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/SFMT/ The objective is to create a fast pseudo-random generator for the Cell Processor
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next