All about DataSince, DataEngineering and ComputerScience
View the Project on GitHub datainsightat/DataScience_Examples
A scalable and secure data platfrom that allows enterprises to
any type or volume of information.
Datalake | Datawarehouse |
---|---|
Native format | Loaded after usecase |
All data types | Processed |
Easy Changes | Faster insights |
Applicatoin-specific | Current and historical data |
Consistent schema |
Data can be imported “as is”.
Data is transformed after loading it in the target.
Data needs to be Transformed before loading.
Cloud Storage simulates file system. Object paths are stored as metadata for each object. They are not partitioned in distinct folders.
File Access
gs://declass/de/modules/02/script.sh
Web Access
https://storage.cloud.google.com/declass/de/modules/02/scsript.sh
Transactional workload is write-heavy. Analytical systems are read-heavy.
$ gsutil -m cp ..
Managed Services for RDBMS (SQL Server, MySQL, PostgreSQL).
Backup, revocery, scaling and security is managed.
Fully managed | Serverless |
---|---|
No setup | No server managemend |
Automatd backups | Fully managed security |
Replicated | Pay for usage |