Data Analysis Pipeline on Cloud Security

Architecture Overview

This documentation outlines the design and implementation of a data analysis pipeline for cloud security, focusing on monitoring and analyzing user access to a web application hosted on a Google Cloud Platform (GCP) Virtual Private Cloud (VPC).

data-pipeline.png

Key Components:

  1. Firewall: Controls access to the VPC.
  2. GCP VPC: Contains the subnet where the web application is hosted.
  3. Log Explorer: Serves as the primary collection point for firewall logs.
  4. Log Router: Routes the collected logs to appropriate storage and processing services.
  5. BigQuery: Provides a platform for log analysis and user monitoring.
  6. Looker: Used for visualization and reporting based on BigQuery datasets.

Pipeline Workflow

1. User Access Through Firewall

2. Logging User Activity