Mark is joined by long-term industry veteran and friend Christian Berg to talk about surviving fifteen years as a contractor in analytics industry, changes he's seen in the market and in how project are approached, the value in getting involved in the community, and in a specially extended Christmas and New Year edition we look back at what was topical in 2017 and what are Christian's predictions for 2018 ... and appoint Christian as Head of our Best Practices Found on the Internet.
Mark Rittman is joined in this episode of Drill to Detail by Dr. Carsten Bange from BARC to talk about findings from the recently completed BI Survey 17 including the continuing move to modern BI platforms and self-service desktop tools, analytics adoption trends and the increasing incorporation of BI functionality within business applications, the surprising topicality of master data management and data governance ... and whatever happened to Nigel Pendse and his legendary OLAP Report?
- The BI Survey 17: The World’s Largest Annual Survey of BI Users
- Master Data and Data Quality Management Now the #1 Trend in BI
- BI Trend Monitor 2018 Infographic: The Evolution of Trends
- The Business Intelligence Industry Continues Its Ongoing Empowerment of Business Users
- The OLAP Report: The origins of today’s OLAP products (c. 2005, from the Internet Archive)
Mark Rittman is joined in this episode by returning special guest Jen Underwood to talk about what's new and innovative in the BI and analytics industry right now, and how AI and machine learning are this year's data discovery and data visualization.
- "Between The Lines At Tableau Conference" - Jen Underwood.com blog
- "Transform The Business With Automated Embedded Artificial Intelligence" - JenUnderwood.com blog
- "Moving From Bi To Machine Learning With Automation" - JenUnderwood.com blog
- "How Smart Data Discovery Will Radically Transform Analytics" - Tellius Webinar with Jen Underwood
- YellowFin BI - homepage
- Paxata - homepage
- "Drill To Detail Ep.8 'Self-Service BI, Data Prep & Big Data Vendor Strategy' With Special Guest Jen Underwood"
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
- Python Data Analysis Library
- "Ibis on Impala: Python at Scale for Data Science"
- Drill To Detail Ep.3 'Apache Kudu And Cloudera's Analytic Platform' With Special Guest Mike Percy
- Apache Arrow homepage
- "Apache Arrow and the "10 Things I Hate About pandas"
- "Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?"
- "Some comments to Daniel Abadi's blog about Apache Arrow"
- Wes McKinney homepage
Mark is joined by Mike Durran from the Oracle Analytics Product Management team in this UKOUG Tech’17 special to talk about his route into product management via the Oracle Discoverer BI tool, Oracle’s latest product in this space Oracle Data Visualization Desktop 4 and its new features, and Mike’s upcoming sessions at the UK Oracle User Group’s Tech’17 event next week in Birmingham, UK.
Mark is joined in this episode by Avi Zloof from Evaluex to talk about the new world of elastically-provisioned cloud-hosted analytic databases such as Google BigQuery and Amazon Athena, how their pricing model and vendor strategy differs from the traditional database vendors, and how machine learning can be used to automate performance tuning and optimize workloads in this new world of large-scale distributed query and storage.
Mark is joined in this episode by Google Cloud Platform Developer Advocate Felipe Hoffa, talking about getting started as a developer using Google BigQuery along with Google Cloud Dataflow, Google Cloud Dataprep and Google Cloud Platform's machine learning APIs.
Mark Rittman is joined in this episode by Taylor Brown from Fivetran to talk about middleware for SaaS data, their focus on integrations with SaaS vendors and how this differentiates their offering, his thoughts on packaged analytic applications announced at the recent Looker Join conference ... and where the name "Fivetran" came from.
In this episode Mark is joined by ex-colleague and now Technical Advisor to Gluent, Michael Rainey, to talk about hybrid platforms and Gluent's new cloud offload capability, the Hadoop market in-general and his thoughts on data engineering and the recently-released AWS Glue data integration service.
- Gluent Cloud Sync – Sharing Data to Enable Analytics in the Cloud
- Gluent Case Studies
- Gluent Data Platform Overview
- Amazon Glue
- The Rise of the Data Engineer and The Downfall of the Data Engineer by Maxime Beauchemin
- Drill to Detail Ep.26 'Airflow, Superset & The Rise of the Data Engineer' with Special Guest Maxime Beauchemin
- Drill to Detail Ep.12 'Gluent and the New World of Hybrid Data' with Special Guest Tanel Poder
Drill to Detail returns for a new season with special guest Jean-Pierre Dijcks, to talk about Oracle's Big Data Strategy now and in the past, thoughts on distributed query and storage in the cloud, and previewing themes and announcements to look forward to at the upcoming Oracle Open World 2017 event running in San Francisco next month
Mark Rittman is joined in this Summer Special episode by none other than Cameron Lackpour, Essbase expert and Oracle ACE Director, to talk about why and how Essbase won the OLAP wars, how Essbase Server works and the role it now plays in Oracle Analytics Cloud and his involvement with user groups over the years. In this specially extended edition he also gives us his reading recommendations for while you're at the pool or, as he will be, out camping, and he also shares his predictions for what we'll hear from Oracle and the analytics industry when he, and Drill to Detail, returns in the autumn after a well-deserved summer break.
Mark Rittman is joined by Industry Analyst Mark Madsen to talk about marketing analytics and the rise of the omni-channel consumer, the use of AI in analytics and personalization and what this all means for brands, for advertisers and for marketers.
In this episode Mark is joined by Jake Stein to talk about Stitch Data and their ETL tool for data engineers, the new open-source project Singer and his experiences building a software startup that both partners and competes with the big cloud platform vendors.
- Stitch Data
- Singer: Simple, Composable Open-Source ETL
- Setting the Data Strategy for Your Growing Organization
- The State of Data Engineering
- The State of Data Science
- Why our ETL Tool Doesn't Do Transformations
- Airflow: a workflow management platform
- Goodbye RJMetrics, Hello Fishtown Analytics
- Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department
Mark Rittman is joined by Donald Farmer to talk about his work at Microsoft on SQL Server Analysis Services and Integration Services, why he moved to Qlik and the challenges of evolving a BI product strategy from focusing on desktops to focusing on the enterprise, and some advice for customers, software vendors and partners working with data and analytics tools.
In this episode Mark is joined by Tristan Handy from Fishtown Analytics to talk about building-out analytics functions in high-growth startups, and three related blog posts he wrote on this topic.
Mark is joined by Qubit colleague Will Browne to talk about a recent academic paper co-authored with Mike Swarbrick Jones on conversion optimisation techniques in the eCommerce industry. Using analytics and statistical analysis On 20 billion "user journeys" recorded in Qubit's Google Cloud Platform-hosted Customer Data Store this paper compares techniques using data and machine learning to those based on traditional sales techniques to see whether data trumps emotion ... or both have their place.
Mark is joined by returning special guest Dan McClary to talk about data modeling and database design on distributed query engines such as Google BigQuery, the underlying Dremel technology and columnar storage format that enables this cloud distributed data warehouse-as-a-service platform to scale to petabyte-size tables spanning tens of thousands of servers, and techniques to optimize BigQuery table joins using nested fields, table partitioning and denormalization.
- Dremel: Interactive Analysis of Web-Scale Datasets
- BigQuery under the hood
- Inside Capacitor, BigQuery’s next-generation columnar storage format
- Drill To Detail Ep.2. 'Future Of SQL On Hadoop', With Special Guest Dan McClary
- Google BigQuery, Large Table Joins and How Nested, Repeated Values and the Capacitor Storage Format (and Looker) Saves the Day
Oracle's Jack Berkowitz joins Mark Rittman to talk about a new category of continuously adapting, self-learning applications being built-out by Oracle that use machine learning together with enterprise and third-party data to create a new generation of intelligent HR, CX, SCM and ERP SaaS apps.
Stewart Bryson returns to the show to join Mark Rittman to discuss new-world BI and data warehousing development using Google BigQuery and Amazon Athena, Apache Kafka and StreamSets, and talks about his experiences with Looker, the cloud-native BI tool that brings semantic modeling and modern development practices to the world of business intelligence.
Mark Rittman is joined in this episode by Independent Consultant Adrian Ward to talk about Oracle Business Analytics, Data Visualization, the BI Applications and his new book on Oracle Business Intelligence 12c.