2022 #

25 Nov – TheDataGuy Migrated From Jekyll To Hugo

2021 #

28 Aug – Debezium With AWS MSK IAM Authentication

03 Aug – Integrate Debezium And Sink Connectors With AWS Glue Schema Registry

20 Jul – Integrate Debezium with AWS Secret Manager For Retrieving Passwords

06 Apr – Airflow PostgreSQL to BigQuery Dynamically Generate DAGs and Tasks

09 Mar – Use AWS Glue DataBrew To Remove Any Sensitive And PII Information

19 Feb – Migration Journey From GreenPlum To BigQuery

2020 #

11 Nov – Migrate Greenplum Partitions To BigQuery With Airflow/Python

07 Nov – Airflow BigQueryHook And Operator To Create Range Partition

02 Nov – Airflow Custom PostgreSQL To Google Cloud Storage Operator

30 Oct – Migrate 5TB ElasticSearch Cluster In GCP With Disk Snapshot

13 Oct – CloudSQL Shared VPC Private IP And servicenetworking.serviceAgent role

28 Sep – Access CloudSQL PostgreSQL FDW From Different VPC Or Public

06 Sep – RedShift COPY Command From SCT Agent - Multiple Tables

31 Aug – Backup RedShift System Tables Incrementally

16 Aug – Backup RedShift Objects To GitHub With Lambda

01 Aug – BigQuery Data Pipeline Without Any Orchestrator Just CloudFunction And PubSub

23 Jul – GCP Convert StackDriver Log Sink As Hive Partition In GCS

18 Jul – AWS RDS pg_dump ERROR LOCK TABLE IN ACCESS SHARE MODE for rds_superuser

09 Jul – How To Restore MySQL Users And Passwords During Migration

19 Jun – Restore AWS ElasticSearch Snapshot Failed - Index settings

12 Jun – Why You Should Not Compress RedShift Sort Key Column

11 Jun – Find Ghost Rows/Dead Rows For All The Tables In RedShift

08 Jun – Why RedShift is showing rows_pre_user_filter is zero

29 May – RedShift Dynamic WLM With Lambda

28 May – RedShift User Activity Log In Spectrum With Glue Grok

20 May – RedShift Tombstone Blocks a visual explanation

13 May – Disk Based Queries - A Hidden Monster in RedShift

26 Apr – Redshift Copy Script From Sct Agent Export S3 Part1

14 Apr – Automate RedShift Vacuum And Analyze with Script

06 Apr – Export RedShift System Tables And Views To S3

05 Apr – Analyze RedShift user activity logs With Athena

25 Mar – Reconstruct RedShift STL_QUERYTEXT using AWS Athena

20 Mar – 3 Things Before Migrating To CloudSQL(MySQL)

18 Mar – RedShift Reconstructing SQL from STL_QUERYTEXT

07 Mar – Experimenting AWS RedShift Column Level ACL

04 Feb – Internals of Google Cloud Spanner

21 Jan – Debezium MySQL Snapshot For CloudSQL(MySQL) From Replica

19 Jan – Setup Multi Data Center Neo4j Cluster In AWS and GCP

19 Jan – Setup Neo4j Causal Cluster On GCP And AWS

02 Jan – Debezium MySQL Snapshot For AWS RDS Aurora From Backup Snaphot

2019 #

31 Dec – Debezium MySQL Snapshot From Read Replica And Resume From Master

28 Dec – Debezium MySQL Snapshot From Read Replica With GTID

25 Dec – RedShift Kill All Locking Sessions On A Table

24 Dec – Monitor Debezium MySQL Connector With Prometheus And Grafana

19 Dec – Build Production Grade Debezium Cluster With Confluent Kafka

22 Nov – RedShift Unload Like A Pro - Multiple Tables And Schemas

07 Nov – Redshift Stored Procedure Comma separated string in Argument

29 Oct – Where GCP Internal TCP Load Balancer Fails

21 Oct – MySQL Calculate How Much Disk Space You Wasted

17 Oct – BackFill Failed Delivery From Kinesis To RedShift With Lambda

08 Oct – AWS Glue Custom Output File Size And Fixed Number Of Files

07 Oct – RedShift Unload All Tables To S3

02 Oct – How GCP Browser Based SSH Works

01 Oct – CloudWatch Custom Log Filter Alarm For Kinesis Load Failed Event

23 Sep – Relationalize Unstructured Data In AWS Athena with GrokSerDe

27 Aug – RedShift Unload to S3 With Partitions - Stored Procedure Way

24 Aug – MySQL Convert Binlog Based Replication To GTID Replication Without Downtime

20 Aug – MongoDB Add Node To Replica Set Without Initial Sync In GCP/AWS

08 Aug – Database Mirroring is still a Mystery

16 May – Monitor Cassandra Clusters with Percona PMM - JMX Grafana and Prometheus

03 Mar – MySQL PITR The Fastest Way With DevOps

25 Feb – MySQL Exact Row Count For All The Tables

02 Feb – MySQL With DevOps 1 - Automate Database Archive

25 Jan – Create Aurora Read Replica With AWS CLI/Lambda Python

12 Jan – AWS DocumentDB - A NoSQL Equivalent For Aurora

01 Jan – Automation Script For Percona Xtrabackup FULL/Incremental

2018 #

29 Dec – RunDeck Series 5 - Encrypt Key Files And Passwords In RunDeck

29 Dec – RunDeck Series 4 - Configure RunDeck SMTP With AWS SES

29 Dec – RunDeck Series 3 - Configure Nginx ProxyPass For RunDeck

28 Dec – RunDeck Series 2 - Add Nodes to the Rundeck

12 Oct – RunDeck Series 1 - Install And Configure RunDeck 3.0 On CentOS 7

26 Sep – Archive MySQL Data In Chunks Using Stored Procedure

10 Sep – Internals Of Google DataStore And Technical Overview

23 Aug – MySQL GTID vs MariaDB GTID

20 Jul – How To Convert MySQL Two Digit Year To Four Digit Year

10 Jul – Automate AWS RedShift Snapshot And Restore

30 May – AWS Athena Automatically Create Partition For Between Two Dates

30 May – Automate AWS Athena Create Partition On Daily Basis

15 May – Automatically Add EC2 Instances to Active Directory Domain

06 Mar – How To Migrate PostgreSQL Users To AWS RDS PostgreSQL

02 Mar – Why Windows Server 2016 In AWS Unable To Resolve Public And Local DNS

10 Feb – Automatically Enable CDC In RDS SQL Server

04 Feb – Don't Use AWS AMI To Backup Your EC2 Database Server

2017 #

31 Dec – How To Restore Corrupted System Databases

28 Dec – Configure Bucardo replication on CentOS

05 Dec – Postgresql Insecure directory in ENV PATH - Unable To Start

04 Nov – How To Change MySQL Default Data,Binlog,Error Log Directories

19 Oct – Compare Two SQL Server Databases using Tsql

19 Oct – How To Install Oracle 12C on Amazon Linux In Silent Mode

19 Oct – What Is AWS Aurora Database Clone

19 Oct – SQL Server All In One Security Audit Script

19 Oct – How To Map Linux Users To Postgres User

19 Oct – SQLServer Backup with dbatools vs Olahallengren