Please note this is a repost from Altimeter’s LinkedIn. You can find the original post here Today we’re very excited to announce our partnership and Series B investment in Tabular, a company building around Apache Iceberg. Tabular is a compelling data lakehouse solution, meaning it brings data warehouse functionality (SQL semantics + ease of use) to the data lake (cost-efficient and scalable).
Great article. Quick question, when Slootman and Nadella talk about of importance with data in AI, how does structured data play into it? Since most ML models are using unstructured data for training, are warehouses only used to store the outputs the model comes up with, or are we going to have models that are trained on unstructured data then given structured to come up with business insights?
Good to see the continued growth and investment in this sector. Early days for sure, but this will be the default for how data is architected: data lakes, open formats, tabular formats and a choice of SQL engines.
Great article. Quick question, when Slootman and Nadella talk about of importance with data in AI, how does structured data play into it? Since most ML models are using unstructured data for training, are warehouses only used to store the outputs the model comes up with, or are we going to have models that are trained on unstructured data then given structured to come up with business insights?
Good to see the continued growth and investment in this sector. Early days for sure, but this will be the default for how data is architected: data lakes, open formats, tabular formats and a choice of SQL engines.