3 ways to create a starter AI data storage system you’ll love

Getting started with the purpose of your Artificial Intelligence (AI) IoT, ML, NLP journey is almost impossible without data. And for one data scientist or developer resource using their local machine with a simple Jupyter notebook instance with sample data and a basic prototype, on a small project that’s just a hobby. But in order to create enterprise AI/ML solutions for your product or company and then scale you need a data ingestion and egress system that can allow for collaboration, snapshotting, and lineage analysis (aka impact analysis).

However, all of the factors that go into an AI storage system come down to the purpose of your initiative. Is it light weight? is it just one person? Is budget unlimited? Is the project throwaway, etc.? So here’s a three ways to create a starter AI data storage system without going into too much detail that you’ll like but perhaps you grow to love depending on the context and depth of your initiative.

#1 – Cloud Object Storage

#2 – Git (specifically Git Branching)

#3 – Data Lake House

More to explorer

Snowflake Loading Data with Special Characters

July 24, 2024 No Comments

Special characters in your column names can cause chaos for downstream users, tools and processes.

Building a Generative AI Competency (or the First Gen AI Project)

July 21, 2024 No Comments

When Building a Generative AI Competency one must identify the necessary infrastructure, architecture, platform, and other resources and partners that can help an AI initiative be successful. We have just like many data warehouse and digital transformation initiatives over the last 20 years fail because of poor leadership, or companies only going half in on the objective.

clock, time management, time-3222267.jpg

Snowflake Time Travel Not the First Time Traveller but Let’s Review

June 12, 2024 No Comments

IBM DB2 long before Snowflake had this concept as did a few other select databases using a Temporal Database concept. I think Snowflake was able to make it more popular and mainstream due to the association of Data Warehousing and analytics specifically