📢 Spark NLP 5.5.0: Unlocking New Horizons with Llama.cpp Integration and More!

We're thrilled to announce the release of Spark NLP 5.5.0, a groundbreaking update that pushes the boundaries of natural language processing! This release is packed with exciting new features, optimizations, and integrations that will transform your NLP workflows. At the heart of this update is our game-changing integration with Llama.cpp, but that's just the beginning of what's in store!

🌟 Spotlight Feature: Llama.cpp Integration

Introducing Llama.cpp Integration: A New Era of Efficient Language Models!

We're proud to present the centerpiece of Spark NLP 5.5.0: the integration of Llama.cpp! This revolutionary addition brings unparalleled efficiency and performance to large language models within the Spark NLP ecosystem.

Optimized Performance: Llama.cpp's C/C++ implementation allows for blazing-fast inference on CPUs, making large language models more accessible than ever.
Reduced Memory Footprint: Enjoy the power of advanced language models with significantly lower RAM requirements.
Quantization Support: Take advantage of various quantization options to further optimize model size and speed without sacrificing quality.
Seamless Integration: Easily incorporate Llama.cpp models into your existing Spark NLP pipelines with our new AutoGGUFModel annotator.

This integration opens up new possibilities for deploying state-of-the-art language models in resource-constrained environments, making advanced NLP capabilities available to a wider range of applications and users.

We extend our heartfelt thanks to all contributors who made this release possible. Your innovative ideas, code contributions, and feedback continue to drive Spark NLP forward. Our Models Hub now contains over 83,000+ free and truly open-source models & pipelines. 🎉

🔥 New Features & Enhancements

Introducing QWEN2Transformer

We have added the QWEN2Transformer annotator, supporting the Qwen-2 model architecture known for its efficiency and performance in various NLP tasks like text generation and summarization.

Spark NLP 5.5.0: Launching Llama.cpp Integration, Llama3, QWEN2, Phi-3, StarCoder2, MiniCPM, NLLB, Nomic, Snowflake, MxBai, more ONNX and OpenVino integrations, more than 50,000 new models, and many more!

📢 Spark NLP 5.5.0: Unlocking New Horizons with Llama.cpp Integration and More!

🌟 Spotlight Feature: Llama.cpp Integration

Introducing Llama.cpp Integration: A New Era of Efficient Language Models!

🔥 New Features & Enhancements

Introducing QWEN2Transformer

Introducing MiniCPM

Introducing NLLB (No Language Left Behind)

Implementing Nomic Embeddings

Snowflake Integration

Introducing CamemBertForZeroShotClassification

Implementing MxBai Embeddings

ONNX Support for Vision Annotators

OpenVINO and ONNX Support for Additional Annotators

Introducing AlbertForZeroShotClassification

Introducing Phi-3

Introducing StarCoder2 for Causal Language Modeling

Introducing LLAMA 3

🐛 Bug Fixes

📦 Dependencies

📝 Models

❤️ Community support

Installation

What's Changed

Contributors