InfoQ Homepage Database Content on InfoQ
-
Optimizing Search Systems: Balancing Speed, Relevance, and Scalability
Innovative software engineer focused on optimizing search performance in dynamic environments. This article highlights key strategies from our QCon San Francisco 2024 presentation, addressing challenges faced by platforms like Uber Eats in data indexing and retrieval. Our advancements ensure swift, relevant user experiences amidst ever-growing datasets.
-
Shadow Table Strategy for Seamless Service Extractions and Data Migrations
The shadow table strategy creates a synchronized duplicate of the data that keeps the production system fully operational during changes, enabling zero-downtime migrations. The approach supports diverse scenarios - including database migrations, microservices extractions, and incremental schema refactoring - that update live systems safely and progressively.
-
Bridging Modalities: Multimodal RAG for Advanced Information Retrieval
In this article, the authors discuss how multi-model retrieval augmented generation (RAG) techniques can enhance AI by integrating multiple modalities like text, images, and audio for deeper contextual understanding, with help of a practical example of a healthcare application.
-
How to Compute without Looking: a Sneak Peek into Secure Multi-Party Computation
This article shows how you can compute a function across multiple parties that do not trust each other without forcing them to share their individual inputs. This technique can be used to split secrets among parties, perform logical operations, or count votes in a way that ensures data privacy is preserved.
-
Reactive Real-Time Notifications with SSE, Spring Boot, and Redis Pub/Sub
Explore the power of reactive programming for building scalable real-time notification systems. Using Spring Boot Reactive and Spring WebFlux, leverage non-blocking operations to handle high-volume, asynchronous data flows efficiently. Discover how Redis Pub/Sub enables event-driven messaging and how the SSE protocol provides persistent connections for instant client updates without polling.
-
Optimizing Wellhub Autocomplete Service Latency: a Multi-Region Architecture
Every company wants fast, reliable, and low-latency services. Achieving these goals requires significant investment and effort. In this article, I will share how Wellhub invested in a multi-region architecture to achieve a low-latency autocomplete service.
-
Modernizing Testing Practices for Jakarta EE Projects
This article focuses on the increasing adoption of data-driven testing in Java enterprise applications and sheds light on the Data and NoSQL Jakarta specifications. It highlights the significance of modern testing libraries such as JUnit Jupiter and AssertJ and emphasizes the importance of container-based frameworks like Testcontainers in enhancing testing practices.
-
Managing 238M Memberships at Netflix
In this article Surabhi Diwan shared how the Netflix membership team does distributed systems: the architecture bets, technology choices, and operational semantics that serve the needs of Netflix’s ever-growing member base.
-
Relational Data at the Edge: How Cloudflare Operates Distributed PostgreSQL Clusters
Explore Cloudflare's distributed PostgreSQL clusters and learn how a cross-region architecture ensures resilience. Discover how data storage and access at the edge deliver massive performance gains by reducing location-sensitive latency and why architecting for degraded states is much harder than for failure states.
-
Zero-Knowledge Proofs for the Layman
This article will introduce you to zero-knowledge proofs, a kind of cryptography you can use to provide the proof you know a secret, such as a private key or the solution to a problem, without ever sharing it to an interested party. While many articles exist on the topic, this will not require any high math knowledge.
-
The Hidden Cost of Using Managed Databases
The rising popularity of managed relational databases brings hidden costs, and informed decisions are crucial for optimal use. This article shows the importance of monitoring service expenses, revising default settings, and understanding operational constraints, considering limitations like reduced flexibility and observability.
-
Understanding Architectures for Multi-Region Data Residency
This article focuses on implementing data residency strategies for a positive stakeholder experience. It underscores the need to diversify data locations, driven by motivations like disaster recovery and geo-redundancy. The core principle is data distribution, ensuring specific sets reside in distinct regions without overlap - a practice termed data residency.