Draft:Apache Pulsar

From Wikipedia, the free encyclopedia
  • Comment: There is not enough independent, significant coverage here. Please read the advice. WikiOriginal-9 (talk) 17:44, 8 November 2023 (UTC)
  • Comment: So many unreferenced sections, also promotional. Vanderwaalforces (talk) 12:56, 8 November 2023 (UTC)

Apache Pulsar.[1]
Original author(s)Yahoo
Developer(s)Apache Software Foundation
Initial releaseSeptember 2016; 7 years ago (2016-09)[2]
Stable release
3.2.2[3] Edit this on Wikidata / 2 April 2024
Repositoryhttps://github.com/apache/pulsar Edit this on Wikidata
Written inJava Edit this on Wikidata
Operating systemCross-platform
TypeStream processing, Message broker
LicenseApache License 2.0

Apache Pulsar is a cloud-native, distributed messaging and stream-processing platform.[4] [5][6]

It provides scalable, durable, and secure messaging and stream processing that aims to handle high-throughput use cases such as event-driven architecture, real-time analytics, and data integration.[7][8]

Pulsar covers both message queue and publish-subscribe (pub-sub) use cases.[9][10][6]

Pulsar is an open-source project by Apache Software Foundation and written in Java.

There are client libraries for several programming languages, including Java, C++, Go, .NET, Node.js, Python, and Rust.

History[edit]

Apache Pulsar was developed by Yahoo! in 2014. It was first released as an open-source project in 2016.[11] Apache Pulsar becomes Top-Level project[12][13] [14]

Architecture[edit]

Apache Pulsar adopts a segmented architecture that separates the serving layer from the storage layer.[15] This separation allows for independent scaling of processing and storage, providing flexibility and better resource utilization.[6][16]

Pulsar Cluster components:

  • Brokers: Stateless components that handle client connections and manage the dispatch of messages to consumers.[17]
  • Bookies: A bookie is a node of Apache BookKeeper, a scalable, fault-tolerant, and low-latency storage service optimized for append-only workloads. It's responsible for storing messages on disk.
  • Distributed metadata store: Used for coordination and metadata storage among different Pulsar components.

A set of multiple Pulsar clusters can be connected as a single Pulsar Instance. It is often used for geo-replicating data across multiple data-centers.

External Links[edit]

See Also[edit]

References[edit]

  1. ^ "Apache Pulsar at GitHub". github.com. Retrieved 7 November 2023.
  2. ^ "Open-sourcing Pulsar, Pub-sub Messaging at Scale". 2019. Retrieved 7 November 2023.
  3. ^ "Release 3.2.2". 2 April 2024. Retrieved 19 April 2024.
  4. ^ What is Apache Pulsar?, 2020, retrieved 9 November 2023
  5. ^ Introduction to Apache Pulsar, 2023, retrieved 9 November 2023
  6. ^ a b c Joseph, Jowanza (2022-01-18). Mastering Apache Pulsar. O'Reilly Media. ISBN 978-1-4920-8490-7.
  7. ^ Apache Pulsar and its enterprise use cases (PDF), 2018, retrieved 9 November 2023
  8. ^ Using Apache Pulsar for Data-Driven Applications, 2021, retrieved 9 November 2023
  9. ^ Exploring Apache Pulsar Use Cases, 2021, retrieved 8 November 2023
  10. ^ How Apache Pulsar is Helping Iterable Scale its Customer Engagement Platform, 2020, retrieved 9 November 2023
  11. ^ "Open-sourcing Pulsar, Pub-sub Messaging at Scale". 2019. Retrieved 7 November 2023.
  12. ^ "The Apache Software Foundation announces Apache® Pulsar™ as a top-level project". September 25, 2018.
  13. ^ "THE APACHE SOFTWARE FOUNDATION ANNOUNCES APACHE® PULSAR™ AS A TOP-LEVEL PROJECT". September 25, 2018.
  14. ^ "Pulsar graduates to being an Apache top-level project". ZDNet. September 27, 2018.
  15. ^ A High-Performance Messaging Platform, with Intel® Optane™ Persistent Memory (PDF), 2022, retrieved 8 November 2023
  16. ^ Comparing Pulsar and Kafka: How a Segment-Based Architecture Delivers Better Performance, Scalability, and Resilience, 2017, retrieved 8 November 2023
  17. ^ Kjerrumgaard, David (2021-12-14). Apache Pulsar in Action. Shelter Island: Simon and Schuster. ISBN 978-1-61729-688-8.