{"id":5229,"date":"2023-03-30T14:40:09","date_gmt":"2023-03-30T18:40:09","guid":{"rendered":"https:\/\/solutionsreview.com\/data-management\/?p=5229"},"modified":"2023-03-31T07:17:33","modified_gmt":"2023-03-31T11:17:33","slug":"apache-arrow-tutorial","status":"publish","type":"post","link":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/","title":{"rendered":"A Brief Apache Arrow Tutorial by Open Data Architecture Expert"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-5244\" src=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg\" alt=\"Apache Arrow Tutorial\" width=\"800\" height=\"400\" srcset=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg 800w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio-300x150.jpg 300w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio-768x384.jpg 768w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio-600x300.jpg 600w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio-162x81.jpg 162w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio-360x180.jpg 360w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/p>\n<p><i><strong>Solutions Review\u2019s Expert Insights Series is a collection of contributed articles written by industry experts in enterprise software categories. In this feature, <a href=\"https:\/\/www.dremio.com\/\" target=\"_blank\" rel=\"noopener\">Dremio<\/a> Developer Advocate Alex Merced offers a brief Apache Arrow tutorial with advice for getting started.<\/strong><\/i><\/p>\n<p style=\"text-align: justify;\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-4899 alignleft\" src=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/01\/oie_6194352a4wFeWoG.png\" alt=\"Expert Insights Badge Small\" width=\"105\" height=\"105\" srcset=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/01\/oie_6194352a4wFeWoG.png 105w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/01\/oie_6194352a4wFeWoG-81x81.png 81w\" sizes=\"(max-width: 105px) 100vw, 105px\" \/>The open-source Apache Arrow project has been transforming the data landscape for the better since its release in 2016, helping to solve the challenges of moving and analyzing large data sets. Fundamentally making data processing and transport faster and cheaper, Apache Arrow provides a powerful, flexible platform for working with big data across different systems and programming languages.<\/p>\n<p>Several of its key innovations enable analytical tooling to be more performant. These include:<\/p>\n<ul>\n<li style=\"text-align: justify;\"><strong>In-memory columnar format:<\/strong> Apache Arrow offers this standard format for how to represent data for analytics in memory. Having a standard format reduces the need to serialize and deserialize between proprietary formats. Arrow supports random access streaming and batch scan workloads.<\/li>\n<li style=\"text-align: justify;\"><strong>Language Libraries:<\/strong> Arrow\u2019s libraries for working with its standard format span more than 12 programming languages, making the benefits available in most languages that developers are using to build tooling. Languages include C, C++, C#, Java, Go, Rust, Python, Javascript, R, Julia, MATLAB, and Ruby.<\/li>\n<li style=\"text-align: justify;\"><strong>Apache Arrow Gandiva:<\/strong> This part of the Arrow project allows for the creation of SQL User Defined Functions (UDFs) for working with Arrow-based data, which can be pre-compiled to native code for improved performance.<\/li>\n<li style=\"text-align: justify;\"><strong>Apache Arrow Flight:<\/strong> Flight is a protocol for transporting Arrow Data over the wire, meant to substitute traditionally row-based options such as JDBC\/ODBC, which would reduce performance due to serialization and deserialization. With Arrow Flight, we can extend the performance benefits, not just in faster processing, but in faster transport. <a href=\"https:\/\/www.youtube.com\/watch?v=dQszohqgZbQ\" target=\"_blank\" rel=\"noopener\">This video<\/a> shows the performance difference between Apache Arrow and ODBC in Python. The transport protocol is implemented as a gRPC API, which means a client implemented in any language can speak to any source implementing the spec. One client is all you need.<\/li>\n<li style=\"text-align: justify;\"><strong>Apache Arrow ODBC\/JDBC Drivers:<\/strong> To make using Flight easier, Arrow now has ODBC\/JDBC drivers that allow developers to leverage the patterns they are used to, but with the speed benefits of Arrow. The Arrow driver can help connect to any Arrow supporting source, ending the days of having to download and configure many drivers.<\/li>\n<\/ul>\n<div class=\"widget\"><div class=\"aside-card\">\t\t\t<div class=\"textwidget\"><a class=\"speedbump\" href=\"https:\/\/solutionsreview.com\/data-management\/data-management-data-warehouse-buyers-guide\/\" title=\"Download link to Data Management Buyers Guide\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1682\" src=\"https:\/\/solutionsreview.com\/data-management\/files\/2019\/01\/data-management-speedbump-cta.jpg\" alt=\"Download Link to Data Management Buyers Guide\" width=\"800\" height=\"225\" \/><\/a><\/div>\n\t\t<\/div><\/div>\n<h2><strong>Apache Arrow Tutorial<\/strong><\/h2>\n<hr \/>\n<h3><strong>How to Begin Working with Arrow<\/strong><\/h3>\n<p>If you are doing analytics, you don\u2019t need to adopt Arrow directly but ensure the platforms, tools, and libraries you are working with take advantage of Arrow. Ask yourself:<\/p>\n<ul>\n<li>Does the query engine that I use to process data use Apache Arrow, and is there an Apache Arrow Flight endpoint for me to pull data from performantly?<\/li>\n<li>Do the libraries I use for analytics support work with Apache Arrow Buffers, Tables, RecordBatchReaders and other objects?<\/li>\n<\/ul>\n<p style=\"text-align: justify;\">Some of the options that answer these questions and offer <a href=\"https:\/\/arrow.apache.org\/powered_by\/\" target=\"_blank\" rel=\"noopener\">support for Arrow<\/a> include: Dremio, Datafusion, Acero, Apache Parquet, Bodo, Clickhouse, Apache Spark, Cylon, Graphique, Dask, InfluxDB, Pandas, Polars, Ray and DuckDB.<\/p>\n<p style=\"text-align: justify;\">To get the full benefit of Apache Arrow, you essentially want three things: (1) a query engine that uses Arrow and has an Arrow Flight endpoint; (2) use of Arrow Flight or the Arrow Flight ODBC\/JDBC drivers to connect via Arrow Flight; and (3) the ability to pull down the data you need, then continue running analytics with libraries that support Apache Arrow. If these things are true, the queries, transport and local ad hoc analysis will see huge improvement, which can also translate into cost savings through less compute costs.<\/p>\n<h3><strong>Tools Easily Work Together<\/strong><\/h3>\n<p>Many tools support Arrow and to demonstrate how easily they can work together, below is a Python code example of pulling down data from an Arrow Flight endpoint from Dremio, then taking the Arrow data and passing it to DuckDB for local management. Users can enjoy this kind of easy experience with an array of tools that support Arrow and Flight.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-5231 aligncenter\" src=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_jtCmjAEUEjtB.png\" alt=\"\" width=\"615\" height=\"511\" srcset=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_jtCmjAEUEjtB.png 615w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_jtCmjAEUEjtB-300x249.png 300w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_jtCmjAEUEjtB-361x300.png 361w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_jtCmjAEUEjtB-97x81.png 97w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_jtCmjAEUEjtB-217x180.png 217w\" sizes=\"(max-width: 615px) 100vw, 615px\" \/><\/p>\n<p style=\"text-align: justify;\">Here\u2019s another example. This one uses Arrow in Javascript <a href=\"https:\/\/arrow.apache.org\/docs\/java\/index.html\" target=\"_blank\" rel=\"noopener\">from the documentation<\/a>. Here, arrays are created to represent different columns of a table and then converted into an Arrow table.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-5232 aligncenter\" src=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_IEEY4z5eX8HS.jpg\" alt=\"\" width=\"625\" height=\"188\" srcset=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_IEEY4z5eX8HS.jpg 625w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_IEEY4z5eX8HS-300x90.jpg 300w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_IEEY4z5eX8HS-600x180.jpg 600w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_IEEY4z5eX8HS-180x54.jpg 180w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_IEEY4z5eX8HS-400x120.jpg 400w\" sizes=\"(max-width: 625px) 100vw, 625px\" \/><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-5233 aligncenter\" src=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_oFRRhDWJwmxu.png\" alt=\"\" width=\"623\" height=\"189\" srcset=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_oFRRhDWJwmxu.png 623w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_oFRRhDWJwmxu-300x91.png 300w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_oFRRhDWJwmxu-600x182.png 600w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_oFRRhDWJwmxu-180x55.png 180w, https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/oie_oFRRhDWJwmxu-400x121.png 400w\" sizes=\"(max-width: 623px) 100vw, 623px\" \/><\/p>\n<h3><strong>A Ubiquitous Piece of the Fabric<\/strong><\/h3>\n<p style=\"text-align: justify;\">Efficient data transfer, fast processing, cross-platform compatibility, memory optimization and standardization are among the key benefits Arrow delivers. Because data is stored in a columnar format, it can be accessed and processed more readily. Columnar data storage allows for efficient vectorized processing, which can significantly improve processing times, especially when working with large datasets. For memory optimization, Arrow uses a &#8220;zero copy&#8221; approach designed to minimize memory usage, which allows data to be passed between systems without copying it into new memory buffers.<\/p>\n<p style=\"text-align: justify;\">By providing a standard in-memory format that allows different software and tools to exchange data at little to no performance cost and by providing libraries for working with that format in most languages, Apache Arrow greatly reduces the costs of transporting data across the wire.<\/p>\n<p style=\"text-align: justify;\">Arrow\u2019s standardization simplifies data integration and makes it easier to build tools that work with data from different sources. Arrow is on a trajectory to become a ubiquitous piece of the fabric for all our favorite data tools.<\/p>\n<div class=\"hr hr\"><\/div>\n<div class=\"widget\"><div class=\"aside-card\">\t\t\t<div class=\"textwidget\"><p><a class=\"speedbump\" href=\"https:\/\/solutionsreview.com\/data-management\/data-management-vendor-map-a-guide-to-the-best-data-management-tools\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1682\" src=\"https:\/\/solutionsreview.com\/data-management\/files\/2019\/01\/data-management-vendor-map-sb-cta.jpg\" alt=\"Download Link to Data Management Vendor Map\" width=\"800\" height=\"225\" \/><\/a><\/p>\n<\/div>\n\t\t<\/div><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Solutions Review\u2019s Expert Insights Series is a collection of contributed articles written by industry experts in enterprise software categories. In this feature, Dremio Developer Advocate Alex Merced offers a brief Apache Arrow tutorial with advice for getting started. The open-source Apache Arrow project has been transforming the data landscape for the better since its release [&hellip;]<\/p>\n","protected":false},"author":583,"featured_media":5244,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[3],"tags":[1359,678],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>A Brief Apache Arrow Tutorial by Open Data Architecture Expert<\/title>\n<meta name=\"description\" content=\"In this feature, Dremio Developer Advocate Alex Merced offers a brief Apache Arrow tutorial with advice for getting started.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"A Brief Apache Arrow Tutorial by Open Data Architecture Expert\" \/>\n<meta property=\"og:description\" content=\"In this feature, Dremio Developer Advocate Alex Merced offers a brief Apache Arrow tutorial with advice for getting started.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/\" \/>\n<meta property=\"og:site_name\" content=\"Best Data Management Software, Vendors and Data Science Platforms\" \/>\n<meta property=\"article:published_time\" content=\"2023-03-30T18:40:09+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-03-31T11:17:33+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"800\" \/>\n\t<meta property=\"og:image:height\" content=\"400\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Alex Merced\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Alex Merced\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/\",\"url\":\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/\",\"name\":\"A Brief Apache Arrow Tutorial by Open Data Architecture Expert\",\"isPartOf\":{\"@id\":\"https:\/\/solutionsreview.com\/data-management\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg\",\"datePublished\":\"2023-03-30T18:40:09+00:00\",\"dateModified\":\"2023-03-31T11:17:33+00:00\",\"author\":{\"@id\":\"https:\/\/solutionsreview.com\/data-management\/#\/schema\/person\/9ea4d93b5de9e2e4843b9d9d29949e76\"},\"description\":\"In this feature, Dremio Developer Advocate Alex Merced offers a brief Apache Arrow tutorial with advice for getting started.\",\"breadcrumb\":{\"@id\":\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#primaryimage\",\"url\":\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg\",\"contentUrl\":\"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg\",\"width\":800,\"height\":400,\"caption\":\"Apache Arrow Tutorial\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/solutionsreview.com\/data-management\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"A Brief Apache Arrow Tutorial by Open Data Architecture Expert\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/solutionsreview.com\/data-management\/#website\",\"url\":\"https:\/\/solutionsreview.com\/data-management\/\",\"name\":\"Best Data Management Software, Vendors and Data Science Platforms\",\"description\":\"Enterprise Information Management\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/solutionsreview.com\/data-management\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/solutionsreview.com\/data-management\/#\/schema\/person\/9ea4d93b5de9e2e4843b9d9d29949e76\",\"name\":\"Alex Merced\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/data-management\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/solutionsreview.com\/data-management\/files\/2025\/12\/Alex-Merced.jpg\",\"contentUrl\":\"https:\/\/solutionsreview.com\/data-management\/files\/2025\/12\/Alex-Merced.jpg\",\"caption\":\"Alex Merced\"},\"description\":\"Alex Merced is a Developer Advocate for Dremio, focusing on educating the industry about using open data architecture with open source technologies such as Apache Iceberg, Apache Arrow, Project Nessie and more. Alex Merced is the host of several data podcasts such as Gnarly Data Waves, DataNation and co-host of \u201cSelect * from data.lake\u201d. Alex Merced has worked as a developer for companies Crossfield Digital, Gened Systems, CampusGuard and has trained engineers as an instructor at General Assembly.\",\"url\":\"https:\/\/solutionsreview.com\/data-management\/author\/amerced\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"A Brief Apache Arrow Tutorial by Open Data Architecture Expert","description":"In this feature, Dremio Developer Advocate Alex Merced offers a brief Apache Arrow tutorial with advice for getting started.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/","og_locale":"en_US","og_type":"article","og_title":"A Brief Apache Arrow Tutorial by Open Data Architecture Expert","og_description":"In this feature, Dremio Developer Advocate Alex Merced offers a brief Apache Arrow tutorial with advice for getting started.","og_url":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/","og_site_name":"Best Data Management Software, Vendors and Data Science Platforms","article_published_time":"2023-03-30T18:40:09+00:00","article_modified_time":"2023-03-31T11:17:33+00:00","og_image":[{"width":800,"height":400,"url":"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg","type":"image\/jpeg"}],"author":"Alex Merced","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Alex Merced","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/","url":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/","name":"A Brief Apache Arrow Tutorial by Open Data Architecture Expert","isPartOf":{"@id":"https:\/\/solutionsreview.com\/data-management\/#website"},"primaryImageOfPage":{"@id":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#primaryimage"},"image":{"@id":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#primaryimage"},"thumbnailUrl":"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg","datePublished":"2023-03-30T18:40:09+00:00","dateModified":"2023-03-31T11:17:33+00:00","author":{"@id":"https:\/\/solutionsreview.com\/data-management\/#\/schema\/person\/9ea4d93b5de9e2e4843b9d9d29949e76"},"description":"In this feature, Dremio Developer Advocate Alex Merced offers a brief Apache Arrow tutorial with advice for getting started.","breadcrumb":{"@id":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#primaryimage","url":"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg","contentUrl":"https:\/\/solutionsreview.com\/data-management\/files\/2023\/03\/Apache-Arrow-Tutorial-Dremio.jpg","width":800,"height":400,"caption":"Apache Arrow Tutorial"},{"@type":"BreadcrumbList","@id":"https:\/\/solutionsreview.com\/data-management\/apache-arrow-tutorial\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/solutionsreview.com\/data-management\/"},{"@type":"ListItem","position":2,"name":"A Brief Apache Arrow Tutorial by Open Data Architecture Expert"}]},{"@type":"WebSite","@id":"https:\/\/solutionsreview.com\/data-management\/#website","url":"https:\/\/solutionsreview.com\/data-management\/","name":"Best Data Management Software, Vendors and Data Science Platforms","description":"Enterprise Information Management","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/solutionsreview.com\/data-management\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/solutionsreview.com\/data-management\/#\/schema\/person\/9ea4d93b5de9e2e4843b9d9d29949e76","name":"Alex Merced","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/data-management\/#\/schema\/person\/image\/","url":"https:\/\/solutionsreview.com\/data-management\/files\/2025\/12\/Alex-Merced.jpg","contentUrl":"https:\/\/solutionsreview.com\/data-management\/files\/2025\/12\/Alex-Merced.jpg","caption":"Alex Merced"},"description":"Alex Merced is a Developer Advocate for Dremio, focusing on educating the industry about using open data architecture with open source technologies such as Apache Iceberg, Apache Arrow, Project Nessie and more. Alex Merced is the host of several data podcasts such as Gnarly Data Waves, DataNation and co-host of \u201cSelect * from data.lake\u201d. Alex Merced has worked as a developer for companies Crossfield Digital, Gened Systems, CampusGuard and has trained engineers as an instructor at General Assembly.","url":"https:\/\/solutionsreview.com\/data-management\/author\/amerced\/"}]}},"_links":{"self":[{"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/posts\/5229"}],"collection":[{"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/users\/583"}],"replies":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/comments?post=5229"}],"version-history":[{"count":0,"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/posts\/5229\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/media\/5244"}],"wp:attachment":[{"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/media?parent=5229"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/categories?post=5229"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/solutionsreview.com\/data-management\/wp-json\/wp\/v2\/tags?post=5229"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}