Data Warehousing Glossary
Data warehousing is a complex and varied topic with a significant number of difficult to understand terms. This glossary should provide a high-level overview of some of the more important data warehousing concepts. Read more about Snowflake itself on our architecture page, and try Snowflake for free to see data warehousing in the cloud first hand.
What is an ad-hoc query?
Ad-hoc queries are created when questions arise that are not able to be solved with predetermined or predefined datasets.
What is data encryption?
Data encryption translates data into another form, or code, so that only people with access to a secret key or password can read it. Often, data encryption is automated as part of other processes within a data platform.
What is data analytics?
Data analytics is the process of querying and interrogating data in the pursuit of valuable insight and information.
Data Mart vs Data Warehouse
What is a data mart?
A data mart is a curated subset of data often generated for analytics and business intelligence users. Data marts are often created as a repository of pertinent information for a subgroup of workers or a particular use case.
What is data protection?
Data protection is the process of safeguarding important information from corruption and/or loss.
What is Data Warehousing?
A data warehouse is a relational database that is designed for analytical rather than transactional work. Typically, data warehouses are used by data scientists, line-of-business analysts, business intelligence developers, and others who have a need to analyze data.
JSON: A Semi-Structured data format
JSON is a semi-structured data format. It can be used in a multitude of applications, but it has become more common as a format for data transmission between servers and web applications or web connected devices. This is partially because those applications can often only receive data as text, and JSON is a text based data format.
What is SaaS?
Software as a service (SaaS) is a software distribution model in which a third-party provider hosts applications and makes them available to customers.
What is semi-structured data?
In comparison to flat files like CSVs, which use relational “columns and rows”, semi-structured files store data in a nested format.
What is SQL?
SQL stands for Structured Query Language. It’s a standardized format for querying and managing databases.
What is a Data Vault?
Data Vault modeling provides a method and approach to modeling your enterprise data warehouse (EDW) that is agile, flexible, and scalable.