{"id":5192,"date":"2025-08-13T17:36:01","date_gmt":"2025-08-13T17:36:01","guid":{"rendered":"https:\/\/b2f.pt\/?p=5192"},"modified":"2025-08-25T08:58:42","modified_gmt":"2025-08-25T08:58:42","slug":"guia-de-otimizacao-spark-no-microsoft-fabric","status":"publish","type":"post","link":"https:\/\/b2f.pt\/en\/guia-de-otimizacao-spark-no-microsoft-fabric\/","title":{"rendered":"Spark Optimization Guide in Microsoft Fabric"},"content":{"rendered":"<p>In the universe of <strong>Business Intelligence<\/strong> modern world, handling large volumes of data efficiently is a constant challenge. The <strong>Microsoft Fabric<\/strong> emerges as a unified platform that simplifies data architecture, and its integration with the <strong>Apache Spark<\/strong> is at the heart of its large-scale data processing capability.<\/p>\n\n\n\n<p>In this post, we will explore how this powerful combination works. We will start by understanding the role of Spark in distributed processing and its main advantages, and then we will show how easy it is to set up and optimize a Spark cluster within the Fabric environment.<\/p>\n\n\n\n<div style=\"height:81px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">1. Overview of Microsoft Fabric<\/h2>\n\n\n\n<p>Microsoft Fabric consolidates functionalities such as Data Warehousing, Data Engineering, real-time processing, Data Science, and Machine Learning into a single platform, making integrated information management easier.<\/p>\n\n\n\n<p>In the realm of Data Engineering, the use of Spark is central. Spark is a distributed processing technology that executes tasks in parallel, optimizing performance in scenarios with large volumes of data. In Fabric, Spark comes pre-integrated\u2014no additional installation is required\u2014and clusters are automatically managed by the service, allowing dynamic scalability according to the workload.<\/p>\n\n\n<div class=\"wp-block-image is-style-rounded\">\n<figure class=\"aligncenter size-large is-resized\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"428\" src=\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-1024x428.png\" alt=\"\" class=\"wp-image-5193\" style=\"width:581px;height:auto\" srcset=\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-1024x428.png 1024w, https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-300x125.png 300w, https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-768x321.png 768w, https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-18x8.png 18w, https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image.png 1115w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure><\/div>\n\n\n<div style=\"height:103px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">2. How Does Spark Work?<\/h2>\n\n\n\n<p>Apache Spark is a distributed processing engine that operates through a master-worker architecture with inherent parallelism, allowing the processing of large volumes of data across multiple machines in a coordinated manner.<\/p>\n\n\n\n<div style=\"height:42px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">2.1. Parallelism Architecture in Spark<\/h3>\n\n\n\n<p>Spark operates on a hierarchical architecture composed of two main types of nodes: the&nbsp;<strong>master node<\/strong>&nbsp;(also referred to as&nbsp;<strong>driver<\/strong>) and the&nbsp;<strong>worker nodes<\/strong>&nbsp;(which execute&nbsp;<strong>executors<\/strong>). This distribution allows Spark to break down complex tasks into smaller operations that are executed in parallel.<\/p>\n\n\n\n<p>The&nbsp;<strong>driver node<\/strong>&nbsp;acts as the central coordinator of the cluster, being responsible for:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Analyzing, distributing, and scheduling tasks among the executors<\/li>\n\n\n\n<li>Maintaining the&nbsp;<strong>SparkContext<\/strong>, which represents the connection to the Spark cluster<\/li>\n\n\n\n<li>Monitoring execution progress and ensuring fault tolerance<\/li>\n<\/ul>\n\n\n\n<p>The&nbsp;<strong>worker nodes<\/strong>&nbsp;contain the&nbsp;<strong>executors<\/strong>, which are processes responsible for the actual execution of tasks. Each executor has two main responsibilities:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Executing the code assigned by the driver\/worker node<\/li>\n\n\n\n<li>Reporting the progress of computations back to the driver node<\/li>\n<\/ul>\n\n\n\n<div style=\"height:38px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">2.2. How Parallelism Works<\/h3>\n\n\n\n<p>Parallelism in Spark is achieved by dividing the data into&nbsp;<strong>partitions<\/strong>&nbsp;distributed across the different nodes of the cluster. Each partition is processed <strong>independently<\/strong> by different threads, allowing simultaneous operations. For example, if a dataset is divided into multiple 128MB partitions, different executors can process these partitions in parallel, maximizing the use of computational resources.<\/p>\n\n\n\n<p>Spark creates a&nbsp;<strong>DAG (Directed Acyclic Graph)<\/strong>&nbsp;to schedule tasks and orchestrate the worker nodes in the cluster. This mechanism allows for optimizing the sequence of operations and facilitates recovery in case of failures by replicating only the necessary operations on the data from a previous state.<\/p>\n\n\n\n<div style=\"height:85px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">3. Spark Configurations in Microsoft Fabric<\/h2>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">3.1. Prerequisites<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Access to the Microsoft Fabric portal with the necessary permissions (admin, contributor, or member)<\/li>\n\n\n\n<li>Previously purchased Fabric SKU OR an active Fabric Trial<\/li>\n<\/ul>\n\n\n\n<div style=\"height:42px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">3.2. Configuration Steps<\/h3>\n\n\n\n<div style=\"height:29px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h4 class=\"wp-block-heading\">Pre-warmed Cluster Configuration<\/h4>\n\n\n\n<p>Microsoft Fabric offers&nbsp;<strong>Starter Pools<\/strong>&nbsp;that use clusters&nbsp;<strong>pre-warmed<\/strong>&nbsp;running on Microsoft virtual machines to <strong>significantly reduce<\/strong> startup times. These clusters are always active and ready for use, providing Spark session initialization typically within 5 to 10 seconds, without the need for manual configuration.<\/p>\n\n\n\n<p>Starter Pools use medium-sized nodes that scale dynamically based on the needs of Spark jobs. When there are no dependencies on custom libraries or custom Spark properties, sessions start almost instantly because the cluster is already running and requires no provisioning time.<\/p>\n\n\n\n<p>However, there are scenarios where the startup time may be longer:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Custom libraries<\/strong>: Adds 30 seconds to 5 minutes for session customization<\/li>\n\n\n\n<li><strong>High regional usage<\/strong>: When Starter Pools are saturated, it may take 2\u20135 minutes to create new clusters<\/li>\n\n\n\n<li><strong>Network options<\/strong>: Private Links or Managed VNets disable Starter Pools, forcing on-demand creation<\/li>\n<\/ul>\n\n\n\n<div style=\"height:47px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h4 class=\"wp-block-heading\">High Concurrency Mode Activation<\/h4>\n\n\n\n<p>It is recommended to enable the&nbsp;<strong>High Concurrency Mode<\/strong>&nbsp;to allow multiple notebooks to share the same Spark session, optimizing resource usage and drastically reducing startup times. In custom high-concurrency pools, users experience significantly faster session startup compared to standard Spark sessions.<\/p>\n\n\n\n<p>To enable High Concurrency Mode:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Access the&nbsp;<strong>Workspace Settings<\/strong><\/li>\n\n\n\n<li>Navigate to&nbsp;<strong>Data Engineering\/Science<\/strong>&nbsp;&gt;&nbsp;<strong>Spark Settings<\/strong>&nbsp;&gt;&nbsp;<strong>High Concurrency<\/strong><\/li>\n\n\n\n<li>Enable the option&nbsp;<em><strong>For notebooks<\/strong><\/em><\/li>\n<\/ol>\n\n\n\n<div style=\"height:38px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n<div class=\"wp-block-image is-style-default\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"635\" height=\"263\" src=\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-1.png\" alt=\"\" class=\"wp-image-5194\" srcset=\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-1.png 635w, https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-1-300x124.png 300w, https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/image-1-18x7.png 18w\" sizes=\"(max-width: 635px) 100vw, 635px\" \/><\/figure><\/div>\n\n\n<div style=\"height:60px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h4 class=\"wp-block-heading\">Recommended Specifications for Pools<\/h4>\n\n\n\n<p>To illustrate the ideal configurations, let's consider an example with SKU&nbsp;<strong>F64<\/strong>:<\/p>\n\n\n\n<p><strong>Base Capacity:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>F64 = 64 Capacity Units = 128 Spark VCores<\/li>\n\n\n\n<li>With a burst factor of 3x = 384 maximum Spark VCores (the burst factor multiplies the available processing capacity to enhance performance)<\/li>\n<\/ul>\n\n\n\n<p><strong>Recommended Configuration for Custom Pool:<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th><strong>Parameter<\/strong><\/th><th><strong>Valor Recomendado<\/strong><\/th><th><strong>Explanation<\/strong><\/th><\/tr><\/thead><tbody><tr><td><strong>Node Family<\/strong><\/td><td>Memory Optimized<\/td><td>Suitable for data processing workloads<\/td><\/tr><tr><td><strong>Node Size<\/strong><\/td><td>Medium (8 VCores)<\/td><td>Balance between performance and concurrency<\/td><\/tr><tr><td><strong>Autoscale<\/strong><\/td><td>Enabled (min: 2, max: 48)<\/td><td>48 nodes \u00d7 8 VCores = 384 VCores (maximum burst)<\/td><\/tr><tr><td><strong>Dynamic Allocation<\/strong><\/td><td>Enabled<\/td><td>Allows automatic adjustment of executors based on demand<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p><strong>Dynamic Allocation Configuration:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Min Executors<\/strong>: 2 (baseline for immediate availability)<\/li>\n\n\n\n<li><strong>Max Executors<\/strong>: 46 (reserving 2 nodes for driver and overhead)<\/li>\n\n\n\n<li><strong>Initial Executors<\/strong>: 4 (balance between startup time and resource waste)<\/li>\n<\/ul>\n\n\n\n<p><strong>Scaling Based on SKU<\/strong>:<\/p>\n\n\n\n<p>For different SKUs, the maximum configurations vary:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th><strong>SKU<\/strong><\/th><th><strong>Capacity Units<\/strong><\/th><th><strong>Max Spark VCores<\/strong><\/th><th><strong>Recommended Node Size<\/strong><\/th><th><strong>Max Nodes<\/strong><\/th><\/tr><\/thead><tbody><tr><td>F2<\/td><td>2<\/td><td>12<\/td><td>Small<\/td><td>3<\/td><\/tr><tr><td>F8<\/td><td>8<\/td><td>48<\/td><td>Medium<\/td><td>6<\/td><\/tr><tr><td>F16<\/td><td>16<\/td><td>96<\/td><td>Medium<\/td><td>12<\/td><\/tr><tr><td>F64<\/td><td>64<\/td><td>384<\/td><td>Medium\/Large<\/td><td>48\/24<\/td><\/tr><tr><td>F128<\/td><td>128<\/td><td>768<\/td><td>Large<\/td><td>48<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<div style=\"height:60px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h4 class=\"wp-block-heading\">Configuration Through the Portal<\/h4>\n\n\n\n<p><strong>In the Microsoft Fabric portal:<\/strong><\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Go to the section&nbsp;<strong>Data Engineering<\/strong>&nbsp;&gt;&nbsp;<strong>Spark Settings<\/strong><\/li>\n\n\n\n<li>Select&nbsp;<strong>New Pool<\/strong>&nbsp;to create a custom cluster<\/li>\n\n\n\n<li>Set the&nbsp;<strong>Node Family<\/strong>&nbsp;and&nbsp;<strong>Node Size<\/strong>&nbsp;according to the requirements<\/li>\n\n\n\n<li>Configure&nbsp;<strong>Autoscale<\/strong>&nbsp;with a minimum number of nodes = 1 (Fabric ensures recoverable availability even with a single node)<\/li>\n\n\n\n<li>Enable&nbsp;<strong>Dynamic Executor Allocation<\/strong>&nbsp;for automatic resource optimization<\/li>\n<\/ol>\n\n\n\n<p><strong>Integration with Data Sources:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Use native connectors to establish connections to Data Lakes or Data Warehouses<\/li>\n\n\n\n<li>Ensure that credentials and security settings are correctly configured<\/li>\n<\/ul>\n\n\n\n<p><strong>Notebook and Task Configuration:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Configure notebooks (Python or Scala) for developing transformation scripts<\/li>\n\n\n\n<li>Schedule batch tasks or configure streaming processes according to requirements<\/li>\n<\/ul>\n\n\n\n<p>Custom pools have a&nbsp;<strong>autopause<\/strong>&nbsp;default of 2 minutes, after which sessions expire and clusters are deallocated, with charges applied only for active usage time.<\/p>\n\n\n\n<p>Of course, the ideal cluster settings may vary depending on the processes to be executed. An evaluation accompanied by tests should always be conducted for each case.<\/p>\n\n\n\n<div style=\"height:86px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">4. Best Practices and Technical Considerations<\/h2>\n\n\n\n<div style=\"height:39px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Sizing and Optimization<\/h3>\n\n\n\n<p>Properly sizing the cluster is essential. Consider:<\/p>\n\n\n\n<p><strong>Burst Factor<\/strong>: Determine the required instant scalability capacity to handle processing peaks. The logic should include multiplying the number of VCores used in the selection to achieve the required burst factor. For example, for an F64 SKU (128 base VCores), configure the pool up to 384 VCores by adjusting the number of nodes and node size (e.g., Medium nodes (8 VCores each) \u00d7 48 nodes = 384 VCores).<\/p>\n\n\n\n<p><strong>Number of Cores and Memory<\/strong>: For the driver node, select an appropriate number of cores and memory, as it orchestrates the processing and must support cluster management tasks. For worker nodes, the choice should be based on multiplying the resources (cores and memory) needed for parallel processing. Consider the possibility of scaling these nodes to adjust performance according to the workload.<\/p>\n\n\n\n<div style=\"height:29px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Automation and Scheduling<\/h3>\n\n\n\n<p>Automate recurring processes through scripts and scheduling, ensuring consistency and minimizing errors.<\/p>\n\n\n\n<div style=\"height:33px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Monitoring<\/h3>\n\n\n\n<p>Use Fabric\u2019s native monitoring tools to identify potential issues and adjust cluster performance in real time.<\/p>\n\n\n\n<div style=\"height:28px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Security<\/h3>\n\n\n\n<p>Ensure the implementation of robust security policies by configuring permissions and using secure connections for data access.<\/p>\n\n\n\n<div style=\"height:47px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">What are the next steps?<\/h2>\n\n\n\n<p>B2F has solid experience in developing solutions on Microsoft Fabric, as well as implementing Spark-based processes. If you need expert support to maximize the performance and efficiency of your analytics platform, our team is ready to collaborate with you and find the best solutions for your challenges.<\/p>","protected":false},"excerpt":{"rendered":"<p>No universo do Business Intelligence moderno, lidar com grandes volumes de dados de forma eficiente \u00e9 um desafio constante. O Microsoft Fabric surge como uma plataforma unificada que simplifica a arquitetura de dados, e a sua integra\u00e7\u00e3o com o Apache Spark \u00e9 o cora\u00e7\u00e3o da sua capacidade de processamento de dados em escala. Neste post, [&hellip;]<\/p>","protected":false},"author":1,"featured_media":5207,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[1],"tags":[42,45,43,79],"class_list":["post-5192","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-business-intelligence","tag-business-intelligence","tag-data-engineering","tag-microsoft-fabric","tag-spark"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.2 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\r\n<title>Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric - B2F - Business to Future<\/title>\r\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\r\n<link rel=\"canonical\" href=\"https:\/\/b2f.pt\/en\/guia-de-otimizacao-spark-no-microsoft-fabric\/\" \/>\r\n<meta property=\"og:locale\" content=\"en_GB\" \/>\r\n<meta property=\"og:type\" content=\"article\" \/>\r\n<meta property=\"og:title\" content=\"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric - B2F - Business to Future\" \/>\r\n<meta property=\"og:description\" content=\"No universo do Business Intelligence moderno, lidar com grandes volumes de dados de forma eficiente \u00e9 um desafio constante. O Microsoft Fabric surge como uma plataforma unificada que simplifica a arquitetura de dados, e a sua integra\u00e7\u00e3o com o Apache Spark \u00e9 o cora\u00e7\u00e3o da sua capacidade de processamento de dados em escala. Neste post, [&hellip;]\" \/>\r\n<meta property=\"og:url\" content=\"https:\/\/b2f.pt\/en\/guia-de-otimizacao-spark-no-microsoft-fabric\/\" \/>\r\n<meta property=\"og:site_name\" content=\"B2F - Business to Future\" \/>\r\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/businesstofuture\/?locale=pt_PT\" \/>\r\n<meta property=\"article:published_time\" content=\"2025-08-13T17:36:01+00:00\" \/>\r\n<meta property=\"article:modified_time\" content=\"2025-08-25T08:58:42+00:00\" \/>\r\n<meta property=\"og:image\" content=\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg\" \/>\r\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\r\n\t<meta property=\"og:image:height\" content=\"2398\" \/>\r\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\r\n<meta name=\"author\" content=\"b2f\" \/>\r\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\r\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"b2f\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"9 minutes\" \/>\r\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/\"},\"author\":{\"name\":\"b2f\",\"@id\":\"https:\/\/b2f.pt\/#\/schema\/person\/53a60449eab118b3e0c1818995da0497\"},\"headline\":\"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric\",\"datePublished\":\"2025-08-13T17:36:01+00:00\",\"dateModified\":\"2025-08-25T08:58:42+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/\"},\"wordCount\":1546,\"publisher\":{\"@id\":\"https:\/\/b2f.pt\/#organization\"},\"image\":{\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg\",\"keywords\":[\"Business Intelligence\",\"Data Engineering\",\"Microsoft Fabric\",\"Spark\"],\"articleSection\":[\"Business Intelligence\"],\"inLanguage\":\"en-GB\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/\",\"url\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/\",\"name\":\"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric - B2F - Business to Future\",\"isPartOf\":{\"@id\":\"https:\/\/b2f.pt\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg\",\"datePublished\":\"2025-08-13T17:36:01+00:00\",\"dateModified\":\"2025-08-25T08:58:42+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#primaryimage\",\"url\":\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg\",\"contentUrl\":\"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg\",\"width\":2560,\"height\":2398},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/b2f.pt\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/b2f.pt\/#website\",\"url\":\"https:\/\/b2f.pt\/\",\"name\":\"B2F - Business to Future\",\"description\":\"Retire Conhecimento Dos Seus Dados\",\"publisher\":{\"@id\":\"https:\/\/b2f.pt\/#organization\"},\"alternateName\":\"B2F\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/b2f.pt\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/b2f.pt\/#organization\",\"name\":\"B2F - Business to Future\",\"alternateName\":\"B2F\",\"url\":\"https:\/\/b2f.pt\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/b2f.pt\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/b2f.pt\/wp-content\/uploads\/2024\/01\/logo.svg\",\"contentUrl\":\"https:\/\/b2f.pt\/wp-content\/uploads\/2024\/01\/logo.svg\",\"width\":2908,\"height\":664,\"caption\":\"B2F - Business to Future\"},\"image\":{\"@id\":\"https:\/\/b2f.pt\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/businesstofuture\/?locale=pt_PT\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/b2f.pt\/#\/schema\/person\/53a60449eab118b3e0c1818995da0497\",\"name\":\"b2f\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/b2f.pt\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/b44a9c116c175f8fb789cb42f4dcedc8051cd52bd25399a21d294fe1f984bbc9?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/b44a9c116c175f8fb789cb42f4dcedc8051cd52bd25399a21d294fe1f984bbc9?s=96&d=mm&r=g\",\"caption\":\"b2f\"},\"sameAs\":[\"https:\/\/b2f.pt\"],\"url\":\"https:\/\/b2f.pt\/en\/author\/b2f\/\"}]}<\/script>\r\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric - B2F - Business to Future","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/b2f.pt\/en\/guia-de-otimizacao-spark-no-microsoft-fabric\/","og_locale":"en_GB","og_type":"article","og_title":"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric - B2F - Business to Future","og_description":"No universo do Business Intelligence moderno, lidar com grandes volumes de dados de forma eficiente \u00e9 um desafio constante. O Microsoft Fabric surge como uma plataforma unificada que simplifica a arquitetura de dados, e a sua integra\u00e7\u00e3o com o Apache Spark \u00e9 o cora\u00e7\u00e3o da sua capacidade de processamento de dados em escala. Neste post, [&hellip;]","og_url":"https:\/\/b2f.pt\/en\/guia-de-otimizacao-spark-no-microsoft-fabric\/","og_site_name":"B2F - Business to Future","article_publisher":"https:\/\/www.facebook.com\/businesstofuture\/?locale=pt_PT","article_published_time":"2025-08-13T17:36:01+00:00","article_modified_time":"2025-08-25T08:58:42+00:00","og_image":[{"width":2560,"height":2398,"url":"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg","type":"image\/jpeg"}],"author":"b2f","twitter_card":"summary_large_image","twitter_misc":{"Written by":"b2f","Estimated reading time":"9 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#article","isPartOf":{"@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/"},"author":{"name":"b2f","@id":"https:\/\/b2f.pt\/#\/schema\/person\/53a60449eab118b3e0c1818995da0497"},"headline":"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric","datePublished":"2025-08-13T17:36:01+00:00","dateModified":"2025-08-25T08:58:42+00:00","mainEntityOfPage":{"@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/"},"wordCount":1546,"publisher":{"@id":"https:\/\/b2f.pt\/#organization"},"image":{"@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#primaryimage"},"thumbnailUrl":"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg","keywords":["Business Intelligence","Data Engineering","Microsoft Fabric","Spark"],"articleSection":["Business Intelligence"],"inLanguage":"en-GB"},{"@type":"WebPage","@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/","url":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/","name":"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric - B2F - Business to Future","isPartOf":{"@id":"https:\/\/b2f.pt\/#website"},"primaryImageOfPage":{"@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#primaryimage"},"image":{"@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#primaryimage"},"thumbnailUrl":"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg","datePublished":"2025-08-13T17:36:01+00:00","dateModified":"2025-08-25T08:58:42+00:00","breadcrumb":{"@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#primaryimage","url":"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg","contentUrl":"https:\/\/b2f.pt\/wp-content\/uploads\/2025\/08\/imagem_comprimida-scaled.jpg","width":2560,"height":2398},{"@type":"BreadcrumbList","@id":"https:\/\/b2f.pt\/guia-de-otimizacao-spark-no-microsoft-fabric\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/b2f.pt\/"},{"@type":"ListItem","position":2,"name":"Guia de Otimiza\u00e7\u00e3o Spark no Microsoft Fabric"}]},{"@type":"WebSite","@id":"https:\/\/b2f.pt\/#website","url":"https:\/\/b2f.pt\/","name":"B2F - Business to Future","description":"Retire Conhecimento Dos Seus Dados","publisher":{"@id":"https:\/\/b2f.pt\/#organization"},"alternateName":"B2F","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/b2f.pt\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Organization","@id":"https:\/\/b2f.pt\/#organization","name":"B2F - Business to Future","alternateName":"B2F","url":"https:\/\/b2f.pt\/","logo":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/b2f.pt\/#\/schema\/logo\/image\/","url":"https:\/\/b2f.pt\/wp-content\/uploads\/2024\/01\/logo.svg","contentUrl":"https:\/\/b2f.pt\/wp-content\/uploads\/2024\/01\/logo.svg","width":2908,"height":664,"caption":"B2F - Business to Future"},"image":{"@id":"https:\/\/b2f.pt\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/businesstofuture\/?locale=pt_PT"]},{"@type":"Person","@id":"https:\/\/b2f.pt\/#\/schema\/person\/53a60449eab118b3e0c1818995da0497","name":"b2f","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/b2f.pt\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/b44a9c116c175f8fb789cb42f4dcedc8051cd52bd25399a21d294fe1f984bbc9?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/b44a9c116c175f8fb789cb42f4dcedc8051cd52bd25399a21d294fe1f984bbc9?s=96&d=mm&r=g","caption":"b2f"},"sameAs":["https:\/\/b2f.pt"],"url":"https:\/\/b2f.pt\/en\/author\/b2f\/"}]}},"_links":{"self":[{"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/posts\/5192","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/comments?post=5192"}],"version-history":[{"count":0,"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/posts\/5192\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/media\/5207"}],"wp:attachment":[{"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/media?parent=5192"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/categories?post=5192"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/b2f.pt\/en\/wp-json\/wp\/v2\/tags?post=5192"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}