{"id":2474,"date":"2019-12-25T11:39:08","date_gmt":"2019-12-25T10:39:08","guid":{"rendered":"https:\/\/myoceane.fr\/?p=2474"},"modified":"2019-12-30T12:18:43","modified_gmt":"2019-12-30T11:18:43","slug":"bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join","status":"publish","type":"post","link":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/","title":{"rendered":"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join"},"content":{"rendered":"<div id=\"fb-root\"><\/div>\n\n<p style=\"text-align: justify;\">Join \u662f\u4e00\u500b\u5728\u95dc\u806f\u6027\u8cc7\u6599\u5eab\u88e1\u9762\u5f88\u5e38\u4f7f\u7528\u7684\u4e00\u500b\u904b\u7b97\u5143\uff0c\u5728\u5927\u6578\u64da\u8cc7\u6599\u5eab\u6162\u6162\u666e\u53ca\u7684\u4eca\u5929\uff0cJoin \u9084\u662f\u4e00\u500b\u5e6b\u52a9\u6211\u5011\u4e86\u89e3\u8cc7\u6599\u95dc\u4fc2\u4e0d\u53ef\u6216\u7f3a\u7684\u89d2\u8272\uff0c\u4eca\u5929\u60f3\u8981\u8a0e\u8ad6\u7684\u662f\u5728 Spark \u88e1\u9762 Join \u80cc\u5f8c\u57f7\u884c\u7684\u904b\u7b97\u539f\u7406\uff0c\u7b46\u8005\u5728\u57f7\u884c Spark \u5de5\u4f5c\u7684\u6642\u5019\uff0c\u6709\u6642\u5019\u9700\u8981\u512a\u5316\u8cc7\u6599\u7684\u904b\u7b97\u904e\u7a0b\u4ee5\u964d\u4f4e\u904b\u7b97\u6240\u9700\u8981\u7684\u6642\u9593\uff0c\u672c\u7bc7\u7684\u8cc7\u6599\u4f86\u6e90\u53ef\u4ee5\u53c3\u8003<a href=\"https:\/\/jaceklaskowski.gitbooks.io\/mastering-spark-sql\/spark-sql-joins.html\">\u9023\u7d50<\/a>\uff0c\u53e6\u5916\u7b46\u8005\u4e5f\u5f88\u5efa\u8b70\u5927\u5bb6\u89c0\u770b\u4ee5\u4e0b\u9019\u4e00\u500b Youtube \u5f71\u7247\u3002<\/p>\n\n\n\n<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"Optimizing Apache Spark SQL Joins: Spark Summit East talk by Vida Ha\" width=\"910\" height=\"512\" src=\"https:\/\/www.youtube.com\/embed\/fp53QhSfQcI?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<p>\u6211\u5011\u5229\u7528 Python(\u95dc\u65bc\u4f7f\u7528 Python \u53ef\u4ee5<a href=\"https:\/\/myoceane.fr\/index.php\/%e5%9c%a8-intellij-%e4%b8%ad%e9%96%8b%e7%99%bc-python-%e7%a8%8b%e5%bc%8f\/\">\u53c3\u8003\u9019\u4e00\u7bc7<\/a>)\u8209\u4e00\u500b\u7c21\u55ae\u7684\u4f8b\u5b50\u5c55\u793a\u5982\u4f55\u53ef\u4ee5\u77e5\u9053\u80cc\u666f\u7684 Join \u4f7f\u7528\u7684\u662f\u54ea\u4e00\u500b\uff0c\u5047\u8a2d\u6211\u5011\u5206\u5225\u6709\u57ce\u5e02\u8207\u570b\u5bb6\u7684\u6a94\u6848\uff0c\u4ed6\u5011\u7684\u5167\u5bb9\u5982\u4e0b\u9762\u5c55\u793a\uff1a<\/p>\n<pre class=\"lang:python\">city = sqlContext.read.option(\"multiline\", \"true\").json('city.json')\ncountry = sqlContext.read.option(\"multiline\", \"true\").json('country.json')\ncity.show()\ncountry.show()\ncity.join(country, city.country == country.countryId).show()<\/pre>\n<p>\u8f38\u51fa\u5f97\u5230\uff1a<\/p>\n<pre class=\"lang:bash\">+---------+------+-------+-------------+\n|     city|cityId|country|population(M)|\n+---------+------+-------+-------------+\n|Kaohsiung|     1|      1|        280.0|\n|   Taipei|     2|      1|        269.0|\n|   France|     3|      2|        214.0|\n|     Lyon|     4|      2|         48.4|\n+---------+------+-------+-------------+\n\n+---------+-------+---------+\n|continent|country|countryId|\n+---------+-------+---------+\n|        2| Taiwan|        1|\n|        1| France|        2|\n+---------+-------+---------+\n\n+---------+------+-------+-------------+---------+-------+---------+\n|     city|cityId|country|population(M)|continent|country|countryId|\n+---------+------+-------+-------------+---------+-------+---------+\n|Kaohsiung|     1|      1|        280.0|        2| Taiwan|        1|\n|   Taipei|     2|      1|        269.0|        2| Taiwan|        1|\n|   France|     3|      2|        214.0|        1| France|        2|\n|     Lyon|     4|      2|         48.4|        1| France|        2|\n+---------+------+-------+-------------+---------+-------+---------+<\/pre>\n\n\n\n<h5>\u4f7f\u7528 explain \u63a2\u67e5 Join \u578b\u614b\uff1a<\/h5>\n<p>\u60f3\u8981\u77e5\u9053\u7a76\u7adf Join \u4f7f\u7528\u7684\u662f\u54ea\u4e00\u7a2e Join \u7684\u80cc\u5f8c\u6a5f\u5236\uff0c\u53ef\u4ee5\u4f7f\u7528\u4e0b\u9762\u7684\u51fd\u5f0f\b .explain()\u3002<\/p>\n<pre class=\"lang:python\">city.join(country, city.country == country.countryId).explain()<\/pre>\n<p>\b\u5f97\u5230\u4ee5\u4e0b\u8f38\u51fa\u8cc7\u8a0a\uff0c\u6211\u5011\u77e5\u9053\u9019\u662f\u4f7f\u7528\u7684\u662f BroadcastHashJoin\uff0c\u4e3b\u8981\u7684\u904b\u4f5c\u908f\u8f2f\u662f\u5c07 country \u7684\u8cc7\u6599 broadcast \u53bb\u8ddf city \u505a join\u3002<\/p>\n<pre class=\"lang:bash\">== Physical Plan ==\n*(2) BroadcastHashJoin [country$2L], [countryId$27L], Inner, BuildRight\n:- *(2) Project [city$0, cityId$1L, country$2L, population(M)$3]\n:  +- *(2) Filter isnotnull(country$2L)\n:     +- *(2) FileScan json [city$0,cityId$1L,country$2L,population(M)$3] Batched: false, Format: JSON, Location: InMemoryFileIndex[file:\/...\/city.json], PartitionFilters: [], PushedFilters: [IsNotNull(country)], ReadSchema: struct&lt;city:string,cityId:bigint,country:bigint,population(M):double&gt;\n+- BroadcastExchange HashedRelationBroadcastMode(List(input[2, bigint, true]))\n   +- *(1) Project [continent$25L, country$26, countryId$27L]\n      +- *(1) Filter isnotnull(countryId$27L)\n         +- *(1) FileScan json [continent$25L,country$26,countryId$27L] Batched: false, Format: JSON, Location: InMemoryFileIndex[file:\/...\/country.json], PartitionFilters: [], PushedFilters: [IsNotNull(countryId)], ReadSchema: struct&lt;continent:bigint,country:string,countryId:bigint&gt;<\/pre>\n<p>&nbsp;<\/p>\n\n\n\n<h5>Youtube \u5f71\u7247\u4ecb\u7d39\u7684 Join \u65b9\u6cd5\uff1a<\/h5>\n<table style=\"border-collapse: collapse; width: 100%; height: 44px;\">\n<tbody>\n<tr style=\"height: 22px;\">\n<td style=\"width: 50%; height: 22px;\"><strong>Basic Joins:<\/strong><\/td>\n<td style=\"width: 50%; height: 22px;\"><strong>Special Cases:<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 22px;\">\n<td style=\"width: 50%; height: 22px;\">\n<ul>\n<li>Shuffle Hash Join<\/li>\n<li>Broadcast Hash Join<\/li>\n<li>Cartesian Join<\/li>\n<\/ul>\n<\/td>\n<td style=\"width: 50%; height: 22px;\">\n<ul>\n<li>Theta Join<\/li>\n<li>One-to-Many Join<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n\n\n\n<p>\u6700\u5e38\u7528\u7684\u5169\u7a2e Join \u65b9\u5f0f\u662f Shuffle Hash Join \u8ddf Broadcast Hash Join\uff0c\u4ee5\u4e0b\u7684\u5167\u5bb9\u4e3b\u8981\u64f7\u53d6\u81ea Youtube \u7684\u5167\u5bb9\uff1a<\/p>\n\n\n\n<h5>Shuffle Hash Join:<\/h5>\n<p>\u5de5\u4f5c\u539f\u7406\u662f\u50b3\u7d71\u7684 Map Reduce \u65b9\u6cd5\uff0c\u9996\u5148\u5148\u6839\u64da join on \u7684 key \u53bb\u628a\u5169\u500b\u8868\u683c\u7684\u5167\u5bb9\u4e1f\u5230\u4e0d\u540c\u7684 Worker Node \u4e0a\u9762\u5982\u4e0b\u5716(\u64f7\u53d6\u81ea Youtube \u5f71\u7247)\u6240\u793a\uff0c\u5982\u6b64\u4e00\u4f86 join \u7684\u5de5\u4f5c\u5c31\u53ef\u4ee5\u6839\u64da join key \u7684\u6578\u91cf\u9032\u884c\u5e73\u884c\u904b\u7b97\uff01<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"576\" src=\"https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.43.54-1024x576.png\" alt=\"\" class=\"wp-image-2600\" srcset=\"https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.43.54-1024x576.png 1024w, https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.43.54-300x169.png 300w, https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.43.54-768x432.png 768w, https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.43.54-1360x765.png 1360w, https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.43.54.png 1816w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Shuffle Hash Join \u7684\u6548\u80fd\u4e3b\u8981\u53d7\u5236\u65bc\u4ee5\u4e0b\u5169\u500b\u8981\u7d20\uff0c&nbsp;<\/p>\n<ol>\n<li>\u5982\u679c\u6a94\u6848\u6c92\u6709\u5e73\u5747\u5206\u914d\u5230\u4e0d\u540c\u7684 key \u4e0a\u9762 -&gt; Uneven Sharding<\/li>\n<li>\u662f\u5426\u64c1\u6709\u9069\u7576\u7684 key \u7684\u6578\u91cf\u4f86\u9032\u884c\u5e73\u884c\u5316 -&gt; Limited Parallelism<\/li>\n<\/ol>\n\n\n\n<h5>Broadcast Hash Join<\/h5>\n<p style=\"text-align: justify;\">\u5982\u540c\u4e0a\u9762 explain \u7684\u4f8b\u5b50\uff0c\u7576 Broadcast Hash Join \u5728\u57f7\u884c\u7684\u6642\u5019\uff0c\u4ed6\u6bd4\u8f03\u9069\u5408\u57f7\u884c\u5728\u5176\u4e2d\u4e00\u500b DataFrame \u6c92\u6709\u90a3\u9ebc\u5927\u7684\u6642\u5019\uff0c\u4ed6\u7684\u904b\u4f5c\u908f\u8f2f\u662f\u5c07 Small DataFrame \u5ee3\u64ad\u9053\u4e0d\u540c\u7684 Partition \u88e1\u9762\uff0c\u6240\u4ee5\u4ed6\u7684\u5e73\u884c\u5316\u6578\u91cf\u662f Large DataFrame \u7684 Partition \u6578\u91cf\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"366\" src=\"https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.56.57-1024x366.png\" alt=\"\" class=\"wp-image-2610\" srcset=\"https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.56.57-1024x366.png 1024w, https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.56.57-300x107.png 300w, https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.56.57-768x274.png 768w, https:\/\/myoceane.fr\/wp-content\/uploads\/2019\/12\/\u87a2\u5e55\u5feb\u7167-2019-12-30-\u4e0a\u534811.56.57.png 1266w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h5>Cartesian Join (\u7b1b\u5361\u5152\u76f4\u7a4d)<\/h5>\n<p>\u5047\u8a2d\u8655\u7406\u7684\u5169\u500b\u8868\u683c\u5206\u5225\u662f 10000 \u884c\u8207 1000 \u884c\uff0c\u4f7f\u7528 Cartesian Join \u6703\u7522\u751f 10000 x 1000 \u884c\u7684 row \uff0c\u4e3b\u8981\u61c9\u8a72\u662f\u8981\u7528\u4f86\u8655\u7406\u4e00\u4e9b UDF \u7684\u8a08\u7b97\uff0c\u4f8b\u5982 Linear Regression \u6216\u662f avg, max \u7b49\u7b49\u7684\u8a08\u7b97\uff01<\/p>\n\n\n\n<h5>\u5176\u4ed6\u7684 Join\u00a0<\/h5>\n<p>\u6700\u5f8c\u5176\u5be6\u9084\u6709\u5f88\u591a Join \u662f\u6703\u88ab\u547c\u53eb\u7684\uff0c\u4f8b\u5982\u6211\u5011\u6539\u7de8\u4e00\u4e0b\u4e4b\u524d\u7684\u7684\u7a0b\u5f0f\u78bc\u5982\uff1a<\/p>\n<pre class=\"lang:python\">city.join(country, city.country &gt; country.countryId).explain()<\/pre>\n<p>\u7d50\u679c\u6703\u5f97\u5230 BroadcastNestedLoopJoin<\/p>\n<pre class=\"lang:bash\">== Physical Plan ==\nBroadcastNestedLoopJoin BuildRight, Inner, (country$2L &gt; countryId$27L)\n:- *(1) Project [city$0, cityId$1L, country$2L, population(M)$3]\n:  +- *(1) Filter isnotnull(country$2L)\n:     +- *(1) FileScan json [city$0,cityId$1L,country$2L,population(M)$3] Batched: false, Format: JSON, Location: InMemoryFileIndex[file:\/...\/city.json], PartitionFilters: [], PushedFilters: [IsNotNull(country)], ReadSchema: struct&lt;city:string,cityId:bigint,country:bigint,population(M):double&gt;\n+- BroadcastExchange IdentityBroadcastMode\n   +- *(2) Project [continent$25L, country$26, countryId$27L]\n      +- *(2) Filter isnotnull(countryId$27L)\n         +- *(2) FileScan json [continent$25L,country$26,countryId$27L] Batched: false, Format: JSON, Location: InMemoryFileIndex[file:\/...\/country.json], PartitionFilters: [], PushedFilters: [IsNotNull(countryId)], ReadSchema: struct&lt;continent:bigint,country:string,countryId:bigint&gt;<\/pre>\n<p>\u00a0<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Join \u662f\u4e00\u500b\u5728\u95dc\u806f\u6027\u8cc7\u6599\u5eab\u88e1\u9762\u5f88\u5e38\u4f7f\u7528\u7684\u4e00\u500b\u904b\u7b97\u5143\uff0c\u5728\u5927\u6578\u64da\u8cc7\u6599\u5eab\u6162\u6162\u666e\u53ca\u7684\u4eca\u5929\uff0cJoin \u9084\u662f\u4e00\u500b\u5e6b\u52a9\u6211\u5011\u4e86\u89e3\u8cc7\u6599\u95dc\u4fc2\u4e0d\u53ef\u6216\u7f3a\u7684\u89d2\u8272\uff0c\u4eca\u5929\u60f3\u8981\u8a0e\u8ad6\u7684\u662f\u5728 Spark \u88e1\u9762 Join \u80cc\u5f8c\u57f7\u884c\u7684\u904b\u7b97\u539f\u7406\uff0c\u7b46\u8005\u5728\u57f7\u884c Spark \u5de5\u4f5c\u7684\u6642\u5019\uff0c\u6709\u6642\u5019\u9700\u8981\u512a\u5316\u8cc7\u6599\u7684\u904b\u7b97\u904e\u7a0b\u4ee5\u964d\u4f4e\u904b\u7b97\u6240\u9700\u8981\u7684\u6642\u9593\uff0c\u672c\u7bc7\u7684\u8cc7\u6599\u4f86\u6e90\u53ef\u4ee5\u53c3\u8003\u9023\u7d50\uff0c\u53e6\u5916\u7b46\u8005\u4e5f\u5f88\u5efa\u8b70\u5927\u5bb6\u89c0\u770b\u4ee5\u4e0b\u9019\u4e00\u500b Youtube \u5f71\u7247\u3002<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9,14],"tags":[],"class_list":["post-2474","post","type-post","status-publish","format-standard","hentry","category-bigdata-ml","category-it-technology"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane<\/title>\n<meta name=\"description\" content=\"Join \u662f\u4e00\u500b\u5728\u95dc\u806f\u6027\u8cc7\u6599\u5eab\u88e1\u9762\u5f88\u5e38\u4f7f\u7528\u7684\u4e00\u500b\u904b\u7b97\u5143\uff0c\u5728\u5927\u6578\u64da\u8cc7\u6599\u5eab\u6162\u6162\u666e\u53ca\u7684\u4eca\u5929\uff0cJoin \u9084\u662f\u4e00\u500b\u5e6b\u52a9\u6211\u5011\u4e86\u89e3\u8cc7\u6599\u95dc\u4fc2\u4e0d\u53ef\u6216\u7f3a\u7684\u89d2\u8272\uff0c\u4eca\u5929\u60f3\u8981\u8a0e\u8ad6\u7684\u662f\u5728 Spark \u88e1\u9762 Join \u80cc\u5f8c\u57f7\u884c\u7684\u904b\u7b97\u539f\u7406\uff0c\u7b46\u8005\u5728\u57f7\u884c Spark \u5de5\u4f5c\u7684\u6642\u5019\uff0c\u6709\u6642\u5019\u9700\u8981\u512a\u5316\u8cc7\u6599\u7684\u904b\u7b97\u904e\u7a0b\u4ee5\u964d\u4f4e\u904b\u7b97\u6240\u9700\u8981\u7684\u6642\u9593\uff0c\u672c\u7bc7\u7684\u8cc7\u6599\u4f86\u6e90\u53ef\u4ee5\u53c3\u8003\u9023\u7d50\uff0c\u53e6\u5916\u7b46\u8005\u4e5f\u5f88\u5efa\u8b70\u5927\u5bb6\u89c0\u770b\u4ee5\u4e0b\u9019\u4e00\u500b Youtube \u5f71\u7247\u3002\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/myoceane.fr\/index.php\/bigdata-\u5927\u6578\u64da\u4e2d\u7684-join\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane\" \/>\n<meta property=\"og:description\" content=\"Join \u662f\u4e00\u500b\u5728\u95dc\u806f\u6027\u8cc7\u6599\u5eab\u88e1\u9762\u5f88\u5e38\u4f7f\u7528\u7684\u4e00\u500b\u904b\u7b97\u5143\uff0c\u5728\u5927\u6578\u64da\u8cc7\u6599\u5eab\u6162\u6162\u666e\u53ca\u7684\u4eca\u5929\uff0cJoin \u9084\u662f\u4e00\u500b\u5e6b\u52a9\u6211\u5011\u4e86\u89e3\u8cc7\u6599\u95dc\u4fc2\u4e0d\u53ef\u6216\u7f3a\u7684\u89d2\u8272\uff0c\u4eca\u5929\u60f3\u8981\u8a0e\u8ad6\u7684\u662f\u5728 Spark \u88e1\u9762 Join \u80cc\u5f8c\u57f7\u884c\u7684\u904b\u7b97\u539f\u7406\uff0c\u7b46\u8005\u5728\u57f7\u884c Spark \u5de5\u4f5c\u7684\u6642\u5019\uff0c\u6709\u6642\u5019\u9700\u8981\u512a\u5316\u8cc7\u6599\u7684\u904b\u7b97\u904e\u7a0b\u4ee5\u964d\u4f4e\u904b\u7b97\u6240\u9700\u8981\u7684\u6642\u9593\uff0c\u672c\u7bc7\u7684\u8cc7\u6599\u4f86\u6e90\u53ef\u4ee5\u53c3\u8003\u9023\u7d50\uff0c\u53e6\u5916\u7b46\u8005\u4e5f\u5f88\u5efa\u8b70\u5927\u5bb6\u89c0\u770b\u4ee5\u4e0b\u9019\u4e00\u500b Youtube \u5f71\u7247\u3002\" \/>\n<meta property=\"og:url\" content=\"https:\/\/myoceane.fr\/index.php\/bigdata-\u5927\u6578\u64da\u4e2d\u7684-join\/\" \/>\n<meta property=\"og:site_name\" content=\"\u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane\" \/>\n<meta property=\"article:published_time\" content=\"2019-12-25T10:39:08+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2019-12-30T11:18:43+00:00\" \/>\n<meta name=\"author\" content=\"\u6ab8\u6aac\u7238\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"\u6ab8\u6aac\u7238\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/\"},\"author\":{\"name\":\"\u6ab8\u6aac\u7238\",\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b\"},\"headline\":\"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join\",\"datePublished\":\"2019-12-25T10:39:08+00:00\",\"dateModified\":\"2019-12-30T11:18:43+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/\"},\"wordCount\":112,\"commentCount\":1,\"publisher\":{\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b\"},\"articleSection\":[\"Big Data &amp; Machine Learning\",\"IT Technology\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/\",\"url\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/\",\"name\":\"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane\",\"isPartOf\":{\"@id\":\"https:\/\/myoceane.fr\/#website\"},\"datePublished\":\"2019-12-25T10:39:08+00:00\",\"dateModified\":\"2019-12-30T11:18:43+00:00\",\"description\":\"Join \u662f\u4e00\u500b\u5728\u95dc\u806f\u6027\u8cc7\u6599\u5eab\u88e1\u9762\u5f88\u5e38\u4f7f\u7528\u7684\u4e00\u500b\u904b\u7b97\u5143\uff0c\u5728\u5927\u6578\u64da\u8cc7\u6599\u5eab\u6162\u6162\u666e\u53ca\u7684\u4eca\u5929\uff0cJoin \u9084\u662f\u4e00\u500b\u5e6b\u52a9\u6211\u5011\u4e86\u89e3\u8cc7\u6599\u95dc\u4fc2\u4e0d\u53ef\u6216\u7f3a\u7684\u89d2\u8272\uff0c\u4eca\u5929\u60f3\u8981\u8a0e\u8ad6\u7684\u662f\u5728 Spark \u88e1\u9762 Join \u80cc\u5f8c\u57f7\u884c\u7684\u904b\u7b97\u539f\u7406\uff0c\u7b46\u8005\u5728\u57f7\u884c Spark \u5de5\u4f5c\u7684\u6642\u5019\uff0c\u6709\u6642\u5019\u9700\u8981\u512a\u5316\u8cc7\u6599\u7684\u904b\u7b97\u904e\u7a0b\u4ee5\u964d\u4f4e\u904b\u7b97\u6240\u9700\u8981\u7684\u6642\u9593\uff0c\u672c\u7bc7\u7684\u8cc7\u6599\u4f86\u6e90\u53ef\u4ee5\u53c3\u8003\u9023\u7d50\uff0c\u53e6\u5916\u7b46\u8005\u4e5f\u5f88\u5efa\u8b70\u5927\u5bb6\u89c0\u770b\u4ee5\u4e0b\u9019\u4e00\u500b Youtube \u5f71\u7247\u3002\",\"breadcrumb\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/myoceane.fr\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/myoceane.fr\/#website\",\"url\":\"https:\/\/myoceane.fr\/\",\"name\":\"M-Y-Oceane \u60f3\u65b9\u6d89\u6cd5\u3002\u91cf\u74f6\u5916\u7684\u5929\u7a7a\",\"description\":\"\u60f3\u65b9\u6d89\u6cd5, France, Taiwan, Health, Information Technology\",\"publisher\":{\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/myoceane.fr\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b\",\"name\":\"\u6ab8\u6aac\u7238\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/6cc678684664f8ad45a8d56a6630b183?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/6cc678684664f8ad45a8d56a6630b183?s=96&d=mm&r=g\",\"caption\":\"\u6ab8\u6aac\u7238\"},\"logo\":{\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/image\/\"},\"url\":\"https:\/\/myoceane.fr\/index.php\/author\/johnny5584767gmail-com\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane","description":"Join \u662f\u4e00\u500b\u5728\u95dc\u806f\u6027\u8cc7\u6599\u5eab\u88e1\u9762\u5f88\u5e38\u4f7f\u7528\u7684\u4e00\u500b\u904b\u7b97\u5143\uff0c\u5728\u5927\u6578\u64da\u8cc7\u6599\u5eab\u6162\u6162\u666e\u53ca\u7684\u4eca\u5929\uff0cJoin \u9084\u662f\u4e00\u500b\u5e6b\u52a9\u6211\u5011\u4e86\u89e3\u8cc7\u6599\u95dc\u4fc2\u4e0d\u53ef\u6216\u7f3a\u7684\u89d2\u8272\uff0c\u4eca\u5929\u60f3\u8981\u8a0e\u8ad6\u7684\u662f\u5728 Spark \u88e1\u9762 Join \u80cc\u5f8c\u57f7\u884c\u7684\u904b\u7b97\u539f\u7406\uff0c\u7b46\u8005\u5728\u57f7\u884c Spark \u5de5\u4f5c\u7684\u6642\u5019\uff0c\u6709\u6642\u5019\u9700\u8981\u512a\u5316\u8cc7\u6599\u7684\u904b\u7b97\u904e\u7a0b\u4ee5\u964d\u4f4e\u904b\u7b97\u6240\u9700\u8981\u7684\u6642\u9593\uff0c\u672c\u7bc7\u7684\u8cc7\u6599\u4f86\u6e90\u53ef\u4ee5\u53c3\u8003\u9023\u7d50\uff0c\u53e6\u5916\u7b46\u8005\u4e5f\u5f88\u5efa\u8b70\u5927\u5bb6\u89c0\u770b\u4ee5\u4e0b\u9019\u4e00\u500b Youtube \u5f71\u7247\u3002","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/myoceane.fr\/index.php\/bigdata-\u5927\u6578\u64da\u4e2d\u7684-join\/","og_locale":"en_US","og_type":"article","og_title":"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane","og_description":"Join \u662f\u4e00\u500b\u5728\u95dc\u806f\u6027\u8cc7\u6599\u5eab\u88e1\u9762\u5f88\u5e38\u4f7f\u7528\u7684\u4e00\u500b\u904b\u7b97\u5143\uff0c\u5728\u5927\u6578\u64da\u8cc7\u6599\u5eab\u6162\u6162\u666e\u53ca\u7684\u4eca\u5929\uff0cJoin \u9084\u662f\u4e00\u500b\u5e6b\u52a9\u6211\u5011\u4e86\u89e3\u8cc7\u6599\u95dc\u4fc2\u4e0d\u53ef\u6216\u7f3a\u7684\u89d2\u8272\uff0c\u4eca\u5929\u60f3\u8981\u8a0e\u8ad6\u7684\u662f\u5728 Spark \u88e1\u9762 Join \u80cc\u5f8c\u57f7\u884c\u7684\u904b\u7b97\u539f\u7406\uff0c\u7b46\u8005\u5728\u57f7\u884c Spark \u5de5\u4f5c\u7684\u6642\u5019\uff0c\u6709\u6642\u5019\u9700\u8981\u512a\u5316\u8cc7\u6599\u7684\u904b\u7b97\u904e\u7a0b\u4ee5\u964d\u4f4e\u904b\u7b97\u6240\u9700\u8981\u7684\u6642\u9593\uff0c\u672c\u7bc7\u7684\u8cc7\u6599\u4f86\u6e90\u53ef\u4ee5\u53c3\u8003\u9023\u7d50\uff0c\u53e6\u5916\u7b46\u8005\u4e5f\u5f88\u5efa\u8b70\u5927\u5bb6\u89c0\u770b\u4ee5\u4e0b\u9019\u4e00\u500b Youtube \u5f71\u7247\u3002","og_url":"https:\/\/myoceane.fr\/index.php\/bigdata-\u5927\u6578\u64da\u4e2d\u7684-join\/","og_site_name":"\u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane","article_published_time":"2019-12-25T10:39:08+00:00","article_modified_time":"2019-12-30T11:18:43+00:00","author":"\u6ab8\u6aac\u7238","twitter_card":"summary_large_image","twitter_misc":{"Written by":"\u6ab8\u6aac\u7238","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/#article","isPartOf":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/"},"author":{"name":"\u6ab8\u6aac\u7238","@id":"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b"},"headline":"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join","datePublished":"2019-12-25T10:39:08+00:00","dateModified":"2019-12-30T11:18:43+00:00","mainEntityOfPage":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/"},"wordCount":112,"commentCount":1,"publisher":{"@id":"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b"},"articleSection":["Big Data &amp; Machine Learning","IT Technology"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/","url":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/","name":"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane","isPartOf":{"@id":"https:\/\/myoceane.fr\/#website"},"datePublished":"2019-12-25T10:39:08+00:00","dateModified":"2019-12-30T11:18:43+00:00","description":"Join \u662f\u4e00\u500b\u5728\u95dc\u806f\u6027\u8cc7\u6599\u5eab\u88e1\u9762\u5f88\u5e38\u4f7f\u7528\u7684\u4e00\u500b\u904b\u7b97\u5143\uff0c\u5728\u5927\u6578\u64da\u8cc7\u6599\u5eab\u6162\u6162\u666e\u53ca\u7684\u4eca\u5929\uff0cJoin \u9084\u662f\u4e00\u500b\u5e6b\u52a9\u6211\u5011\u4e86\u89e3\u8cc7\u6599\u95dc\u4fc2\u4e0d\u53ef\u6216\u7f3a\u7684\u89d2\u8272\uff0c\u4eca\u5929\u60f3\u8981\u8a0e\u8ad6\u7684\u662f\u5728 Spark \u88e1\u9762 Join \u80cc\u5f8c\u57f7\u884c\u7684\u904b\u7b97\u539f\u7406\uff0c\u7b46\u8005\u5728\u57f7\u884c Spark \u5de5\u4f5c\u7684\u6642\u5019\uff0c\u6709\u6642\u5019\u9700\u8981\u512a\u5316\u8cc7\u6599\u7684\u904b\u7b97\u904e\u7a0b\u4ee5\u964d\u4f4e\u904b\u7b97\u6240\u9700\u8981\u7684\u6642\u9593\uff0c\u672c\u7bc7\u7684\u8cc7\u6599\u4f86\u6e90\u53ef\u4ee5\u53c3\u8003\u9023\u7d50\uff0c\u53e6\u5916\u7b46\u8005\u4e5f\u5f88\u5efa\u8b70\u5927\u5bb6\u89c0\u770b\u4ee5\u4e0b\u9019\u4e00\u500b Youtube \u5f71\u7247\u3002","breadcrumb":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%a4%a7%e6%95%b8%e6%93%9a%e4%b8%ad%e7%9a%84-join\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/myoceane.fr\/"},{"@type":"ListItem","position":2,"name":"[BigData] \u5927\u6578\u64da\u4e2d\u7684 Join"}]},{"@type":"WebSite","@id":"https:\/\/myoceane.fr\/#website","url":"https:\/\/myoceane.fr\/","name":"M-Y-Oceane \u60f3\u65b9\u6d89\u6cd5\u3002\u91cf\u74f6\u5916\u7684\u5929\u7a7a","description":"\u60f3\u65b9\u6d89\u6cd5, France, Taiwan, Health, Information Technology","publisher":{"@id":"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/myoceane.fr\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b","name":"\u6ab8\u6aac\u7238","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/myoceane.fr\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/6cc678684664f8ad45a8d56a6630b183?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/6cc678684664f8ad45a8d56a6630b183?s=96&d=mm&r=g","caption":"\u6ab8\u6aac\u7238"},"logo":{"@id":"https:\/\/myoceane.fr\/#\/schema\/person\/image\/"},"url":"https:\/\/myoceane.fr\/index.php\/author\/johnny5584767gmail-com\/"}]}},"amp_enabled":false,"_links":{"self":[{"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/posts\/2474","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/comments?post=2474"}],"version-history":[{"count":51,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/posts\/2474\/revisions"}],"predecessor-version":[{"id":2628,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/posts\/2474\/revisions\/2628"}],"wp:attachment":[{"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/media?parent=2474"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/categories?post=2474"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/tags?post=2474"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}