data architecture framework
Articles Data Engineering Data Pipelines Data Structuring DataOps/Analytics Real time data

VIDEO: How Data Management capabilities fit together in an effective data architecture framework

0

A data architecture, in simple terms, is a framework for the IT infrastructure to be able to support the data strategy. It refers to the models, rules through which the data is collected, arranged, stored, transported, and utilized in an organization. Data capabilities include metadata management, master/reference data management, data

Apache Kafka Articles Data Pipeline Data Pipelines Data Structuring Kafka Kafka Architecture Real Time Streaming

Start Automating your Data pipelines with Apache Airflow

0

About Apache Kafka & Apache Airflow Kafka is a popular messaging platform based on pub-sub mechanism. It is a highly available, fault tolerant & distributed system. Most organisations are using Kafka in different use-cases, but the best part of Kafka is it’s service oriented architecture, it makes it language agnostic,

Machine learning with Kafka
AI/ML Articles Artificial Intelligence Data Engineering Data Pipelines Kafka Kafka Architecture Kafka Streams

Machine Learning with Kafka

0

Machine learning with Kafka - The modern Stream Processing  platform for all data preprocessing needs Some bits about Kafka Kafka is a popular messaging platform based on pub-sub mechanism. It is a highly available, fault tolerant & distributed system. Most organisations are using Kafka in different use-cases, but the best

Big Data Analytics at MoPub
Analytics Tools Articles Case Studies Data Pipelines Data Structuring DataOps/Analytics Real time data

CASE STUDY: MoPub Querying Terabytes of Data in Seconds using Big data analytics

0

MoPub provides monetization solutions for mobile app publishers and developers globally. The company platforms to drive maximum revenue for every ad impression and control the user experiences. It has over 1.7 billion monthly unique devices, 1 trillion ad requests, 52000 plus apps, and more than 180 demand-side partners on its

streaming data pipeline in python
Articles Data Pipeline Data Pipelines Event Streaming Real Time Streaming

Creating your own streaming data streams in python

0

Kafka is a popular messaging platform based on pub-sub mechanism. It is a highly available, fault tolerant & distributed system. Most organisations are using Kafka in different use-cases, but the best part of Kafka is it’s service oriented architecture, it makes it language agnostic, and gives it wide usability. One

Data pipeline on cloud
Articles Case Studies Cloud Cloud Computing Confluent Kafka Data Engineering Data Pipeline Data Pipelines Event Streaming Kafka Real time data streaming Real Time Streaming

CASE STUDY: Drug discovery with new data pipelines based on Confluent Cloud

0

Recursion is a biotechnology company founded in 2013, headquartered in Salt Lake City, Utah. It accelerates drug discovery by combining experimental biology, artificial intelligence, automation, and real-time event streaming. It has built a system that processes over three petabytes of biological image data generated on Recursion’s robotic platform. The company

Rwal time data analytics using AWS Kinesis
Apache Kafka Apache Nifi Articles Data Engineering Data Ingestion Tools Data Modeling Data Pipelines Data Structuring Real time data

Enterprise Data Streaming Architecture

0

A modern enterprise Data Streaming Architecture will have a lot of communications, these can include application queues, streaming data, control data, data transfers, for backup, syncing, updates, IOT sensor data, system snapshots and other never ending use cases. We would be discussing about some commonly used use-cases in most enterprises.

Big data analytics using Kafka
Apache Kafka Articles Data Pipelines Kafka Architecture Kafka Use Cases Real time data streaming Real Time Streaming

Real-Time Streaming and Data Pipelines with Apache Kafka

0

Every large corporation and SME have found real-time data analysis quite critical. Many industries such as legal services, financial services and IT operation management are in need of massive real-time data along with historical data. When we are in need of handling high volume data, we should implement the best

How To Build a Scalable Big Data Pipeline
Articles Data Engineering Data Pipeline Data Pipelines

How To Build a Scalable Big Data Pipeline

0

When you deploy machine learning, big-data analytics, and data science in real-time, you need to remember that model training and analytics tuning occupies only a portion of the work. Around 50% of the effort is dependent upon grooming the data for Machine Learning and Analytics. The rest of the effort

Data Pipeline definition
Articles Data Pipeline Data Pipelines ETL Pipeline Real time data

What is a Data Pipeline? Definition & Examples

0

Have you ever star-gazed? Let’s imagine that you are counting the number of stars in the sky. Would you be able to count all the stars? You can categorize them for sure. That’s exactly how abundant data is nowadays. When you allow data flow from one location to the next,

Real time data streaming and Analytics
Articles Case Studies Data Pipeline Data Pipelines Real time data Real time data streaming Real Time Streaming

CASE STUDY: Real time analytics & Data management at Charter

0

Charter Communications, Inc. is a leading broadband connectivity company and cable operator. Through their brand, Spectrum, they offer a full range of state-of-the-art residential and business services including Spectrum Internet®, TV, Mobile, and Voice in 41 states for more than 30 million customers. As customers always require better reliability, competitive

Real time Data Streaming
Apache Kafka Apache Nifi Articles Data Ingestion Tools Data Pipeline Data Pipelines Kafka Architecture Kafka as a Service Kafka Use Cases

Data Ingestion Pipelines & use cases

0

What is a Data pipeline? A data pipeline is a system where data is transferred in chunks in a serial and systematic manner (Messages, records) between systems. These flows are well defined, audited and might contain sensitive information, which needs to be secured.  These pipelines can be application queues, transfers