{"id":10066,"date":"2025-05-29T08:05:56","date_gmt":"2025-05-29T06:05:56","guid":{"rendered":"https:\/\/myoceane.fr\/?p=10066"},"modified":"2025-06-06T08:48:34","modified_gmt":"2025-06-06T06:48:34","slug":"bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba","status":"publish","type":"post","link":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/","title":{"rendered":"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa"},"content":{"rendered":"<div id=\"fb-root\"><\/div>\n\n<p>Spark 2.1.0 \u4e4b\u5f8c\uff0cJDBC (Java Database Connectivity) \u88ab\u5f15\u5165\u4f5c\u70ba\u4e00\u500b Spark \u61c9\u7528\u7684\u5176\u4e2d\u4e00\u7a2e\u8f38\u5165\uff0c\u5728\u6578\u4f4d\u8f49\u578b\u7684\u904e\u7a0b\u4e2d\u6211\u5011\u5f88\u5e38\u9700\u8981\u5c07\u4f01\u696d\u4e2d\u7684\u8cc7\u6599\u5009\u5132\u532f\u5165\u8cc7\u6599\u6e56\u5eab\uff0c\u70ba\u4e86\u7701\u53bb\u8cc7\u6599\u8f49\u63db\u7684\u529f\u592b\uff0c\u6700\u597d\u7684\u65b9\u6cd5\u5c31\u662f\u76f4\u63a5\u5c07 RDBMS \u7684\u8cc7\u6599\u76f4\u63a5\u8f49\u9032 Datalake \u88e1\u9762\uff0c\u672c\u7bc7\u5e0c\u671b\u7d00\u9304\u5728 Spark 3.3.0 \u74b0\u5883\u4e2d\u5c07 MSSQL \u7684\u5de8\u91cf\u8cc7\u6599\u8f49\u79fb\u5230\u5730\u7aef\u7684\u8cc7\u6599\u6e56\u5eab\u4e2d\u7684\u904e\u7a0b\u8207 TroubleShooting\u3002<\/p>\n<h4>Loading<\/h4>\n<p>\u95dc\u65bc Spark \u67b6\u69cb\u4e2d JDBC \u7684\u4f7f\u7528\u65b9\u6cd5\uff0c\u8a73\u7d30\u53ef\u4ee5\u53c3\u8003<a href=\"https:\/\/spark.apache.org\/docs\/3.5.2\/sql-data-sources-jdbc.html\">\u5b98\u7db2<\/a>\uff0c\u900f\u904e Spark \u7684\u6846\u67b6\u9996\u5148\u53ef\u4ee5\u5229\u7528\u4ee5\u4e0b\u7684 Scala \u6307\u4ee4\u5c07\u7d66\u5b9a\u7684 jdbc \u8cc7\u6599\u4f86\u6e90\u8f09\u5165\u6210\u4e00\u500b <a href=\"https:\/\/spark.apache.org\/docs\/3.5.2\/api\/scala\/org\/apache\/spark\/rdd\/JdbcRDD.html\">JdbcRDD<\/a>\u3002<\/p>\n<pre class=\"lang:scala\">val df = spark.read.format(\"com.microsoft.sqlserver.jdbc.spark\")\n    .option(\"url\", url)\n    .option(\"dbtable\", dbTable)\n    .option(\"username\", username)\n    .option(\"password\", password)\n    .load()<br \/>df.show()<\/pre>\n<p>\u6b64\u6642\u53ef\u4ee5\u770b\u5230 Spark \u53ea\u547c\u53eb\u4e86\u4e00\u500b executor \u8d77\u4f86\u505a\u8cc7\u6599\u8b80\u53d6\u7684\u5de5\u4f5c\u3002<\/p>\n<h4>Dependency<\/h4>\n<p style=\"text-align: justify;\">\u8981\u4f7f\u7528 com.microsoft.sqlsever.jdbc.spark \u4f86\u9023\u7d50 MSSQL Server \u9700\u8981\u5f15\u5165 spark-mssql-connector \u9019\u500b Jar \u5305\uff0c\u7531\u65bc<a href=\"https:\/\/github.com\/microsoft\/sql-spark-connector\">\u9019\u500b Repo<\/a> \u5df2\u7d93 Public Archive \u4e86\uff0c\u611f\u89ba\u53ea\u652f\u63f4\u5230 Spark 3.4 \u7248\uff0cSpark 3.5 \u7248\u4e4b\u5f8c\u611f\u89ba\u5c31\u6c92\u6709\u652f\u63f4\u4e86\uff0c\u53e6\u5916\u7531\u65bc\u5728\u516c\u958b\u7684 Maven Repo \u88e1\u9762\u53ea\u770b\u5f97\u5230 <a href=\"https:\/\/central.sonatype.com\/artifact\/com.microsoft.azure\/spark-mssql-connector_2.12?smo=true\">1.3.0_BETA<\/a> \u7248\u672c\uff0c\u6240\u4ee5\u5728\u6574\u5408\u7684\u6642\u5019\u53ef\u80fd\u8981\u7279\u5225\u6ce8\u610f\u4e00\u4e0b\uff0c\u7b46\u8005\u5617\u8a66\u5728 Spark 3.5 \u7684\u74b0\u5883\u4e2d\u53bb\u4f7f\u7528\u7684\u6642\u5019\u5c31\u6703\u9047\u5230\u4ee5\u4e0b\u7684\u932f\u8aa4\u8a0a\u606f\uff1a<\/p>\n<pre class=\"lang:bash\">Py4JJavaError: An error occurred while calling o101.load.\n: java.sql.SQLException: No suitable driver\n\u2003\u2003at java.sql.DriverManager.getDriver(DriverManager.java:315)\n\u2003\u2003at org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions.$anonfun$driverClass$2(JDBCOptions.scala:109)\n\u2003\u2003at scala.Option.getOrElse(Option.scala:189)\n\u2003\u2003at org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions.&lt;init&gt;(JDBCOptions.scala:109)\n\u2003\u2003at org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions.&lt;init&gt;(JDBCOptions.scala:41)\n\u2003\u2003at org.apache.spark.sql.execution.datasources.jdbc.JdbcRelationProvider.createRelation(JdbcRelationProvider.scala:34)\n\u2003\u2003at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:346)\n\u2003\u2003at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:229)\n\u2003\u2003at org.apache.spark.sql.DataFrameReader.$anonfun$load$2(DataFrameReader.scala:211)\n\u2003\u2003at scala.Option.getOrElse(Option.scala:189)\n\u2003\u2003at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:211)\n\u2003\u2003at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:172)\n\u2003\u2003at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)\n\u2003\u2003at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)\n\u2003\u2003at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)\n\u2003\u2003at java.lang.reflect.Method.invoke(Method.java:498)\n\u2003\u2003at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:244)\n\u2003\u2003at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:374)\n\u2003\u2003at py4j.Gateway.invoke(Gateway.java:282)\n\u2003\u2003at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:132)\n\u2003\u2003at py4j.commands.CallCommand.execute(CallCommand.java:79)\n\u2003\u2003at py4j.ClientServerConnection.waitForCommands(ClientServerConnection.java:182)\n\u2003\u2003at py4j.ClientServerConnection.run(ClientServerConnection.java:106)\n\u2003\u2003at java.lang.Thread.run(Thread.java:750)<\/pre>\n<h4>Parallel Loading<\/h4>\n<p>\u5982\u679c\u60f3\u8981\u5229\u7528\u591a\u500b Connection \u540c\u6642\u9032\u884c\u8cc7\u6599\u7684\u8f09\u5165\uff0c\u53ef\u4ee5\u5229\u7528 numPartitions \u7684\u8a2d\u5b9a\u6216\u8005\u662f partitionColumn, lowerBound, upperBound \u9019\u5e7e\u500b\u8a2d\u5b9a\u53bb\u9054\u5230 parallel loading\uff0c\u4ee5\u4e0b\uff1a<\/p>\n<pre class=\"lang:bash\">val df = spark.read.format(\"com.microsoft.sqlserver.jdbc.spark\")\n    .option(\"url\", jdbcUrl)\n    .option(\"numPartitions\", 20)\n    .load(()<\/pre>\n<pre class=\"lang:bash\">val df = spark.read.format(\"com.microsoft.sqlserver.jdbc.spark\")\n    .option(\"url\", jdbcUrl)\n    .option(\"dbtable\", dbTable)\n    .option(\"partitionColumn\", \"c1\")\n    .option(\"lowerBound\", \"1\")\n    .option(\"upperBound\", \"100\")\n    .option(\"numPartitions\", \"3\")\n    .load()<\/pre>\n<p style=\"text-align: justify;\">\u53ef\u4ee5\u76f4\u63a5\u8a2d\u5b9a numPartitions \u6216\u662f\u5229\u7528 partitionColumn \u53bb\u5b9a\u7fa9 Partition \u7684\u65b9\u6cd5\uff0c\u4e0d\u904e partitionColumn \u7684\u6b04\u4f4d\u5fc5\u9808\u8981\u662f\u4ee5\u4e0b\u5e7e\u7a2e\uff1anumeric, date, or timestamp\uff0c\u57f7\u884c\u5b8c\u4e4b\u5f8c\uff0c\u8a18\u5f97\u8981\u6aa2\u67e5 numPartitions \u662f\u4e0d\u662f\u771f\u7684\u5982\u540c\u8a2d\u5b9a\u7684\u4e00\u6a23\u3002<\/p>\n<pre class=\"lang:bash\">df.rdd.getNumPartitions()<\/pre>\n<p>\u8fd1\u4e00\u6b65\u53ef\u4ee5\u53c3\u8003\uff1a<a href=\"https:\/\/www.waitingforcode.com\/apache-spark-sql\/partitioning-rdbms-data-spark-sql-jdbc\/read\">Partitioning RDBMS data in Spark SQL<\/a><\/p>\n\n\n\n<h4>\u5176\u4ed6<\/h4>\n<p>\u5728\u5be6\u4f5c\u7684\u6642\u5019\uff0c\u5076\u723e\u4e5f\u6703\u9047\u5230\u4ee5\u4e0b\u7684\u8b66\u544a\uff0c\u7531\u65bc\u4f7f\u7528\u7684\u8cc7\u6599\u4f86\u6e90\u70ba MSSQL \u731c\u6e2c\u53ef\u80fd\u7684\u539f\u56e0\u662f\u56e0\u70ba Connection \u6578\u91cf\u4e0d\u5920 Spark \u5e73\u884c\u7684\u8f09\u5165\uff0c\u6240\u4ee5\u4e1f\u51fa\u9019\u500b\u8b66\u544a\u3002<\/p>\n<pre class=\"lang:bash\">04:29:05.762 INFO  HiveMetaStore - 0: get_database: default\n04:29:05.762 INFO  audit - ugi=root\u2003\u2003ip=unknown-ip-addr\u2003\u2003cmd=get_database: default\u2003\u2003\n04:29:05.841 INFO  HiveMetaStore - 0: get_table : db=default tbl=test\n04:29:05.841 INFO  audit - ugi=root\u2003\u2003ip=unknown-ip-addr\u2003\u2003cmd=get_table : db=default tbl=test\u2003\u2003\n04:29:05.913 INFO  HiveMetaStore - 0: get_database: default\n04:29:05.913 INFO  audit - ugi=root\u2003\u2003ip=unknown-ip-addr\u2003\u2003cmd=get_database: default\u2003\u2003\n04:29:05.987 INFO  HiveMetaStore - 0: get_table : db=default tbl=test\n04:29:05.987 INFO  audit - ugi=root\u2003\u2003ip=unknown-ip-addr\u2003\u2003cmd=get_table : db=default tbl=test\u2003\u2003\n04:29:06.048 INFO  HiveMetaStore - 0: get_table : db=default tbl=test\n04:29:06.048 INFO  audit - ugi=root\u2003\u2003ip=unknown-ip-addr\u2003\u2003cmd=get_table : db=default tbl=test\u2003\u2003\n04:29:06.107 INFO  HiveMetaStore - 0: get_database: default\n04:29:06.107 INFO  audit - ugi=root\u2003\u2003ip=unknown-ip-addr\u2003\u2003cmd=get_database: default\u2003\u2003\n04:29:06.184 INFO  HiveMetaStore - 0: get_database: default\n04:29:06.184 INFO  audit - ugi=root\u2003\u2003ip=unknown-ip-addr\u2003\u2003cmd=get_database: default\u2003\u2003\n04:36:09.210 WARN  TaskSetManager - Lost task 17.0 in stage 25.0 (TID 589) (192.168.14.238 executor 229): com.microsoft.sqlserver.jdbc.SQLServerException: \u67e5\u8a62\u8655\u7406\u5668\u7121\u6cd5\u70ba\u5e73\u884c\u67e5\u8a62\u7684\u57f7\u884c\u555f\u52d5\u5fc5\u8981\u7684\u57f7\u884c\u7dd2\u8cc7\u6e90\u3002\n\u2003\u2003at com.microsoft.sqlserver.jdbc.SQLServerException.makeFromDatabaseError(SQLServerException.java:259)\n\u2003\u2003at com.microsoft.sqlserver.jdbc.SQLServerResultSet$FetchBuffer.nextRow(SQLServerResultSet.java:6388)\n\u2003\u2003at com.microsoft.sqlserver.jdbc.SQLServerResultSet.fetchBufferNext(SQLServerResultSet.java:1647)\n\u2003\u2003at com.microsoft.sqlserver.jdbc.SQLServerResultSet.next(SQLServerResultSet.java:988)\n\u2003\u2003at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$$anon$1.getNext(JdbcUtils.scala:344)\n\u2003\u2003at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$$anon$1.getNext(JdbcUtils.scala:330)\n\u2003\u2003at org.apache.spark.util.NextIterator.hasNext(NextIterator.scala:73)\n\u2003\u2003at org.apache.spark.InterruptibleIterator.hasNext(InterruptibleIterator.scala:37)\n\u2003\u2003at org.apache.spark.util.CompletionIterator.hasNext(CompletionIterator.scala:31)\n\u2003\u2003at org.apache.spark.sql.catalyst.expressions.GeneratedClass$GeneratedIteratorForCodegenStage1.processNext(Unknown Source)\n\u2003\u2003at org.apache.spark.sql.execution.BufferedRowIterator.hasNext(BufferedRowIterator.java:43)\n\u2003\u2003at org.apache.spark.sql.execution.WholeStageCodegenExec$$anon$1.hasNext(WholeStageCodegenExec.scala:760)\n\u2003\u2003at org.apache.spark.sql.execution.UnsafeExternalRowSorter.sort(UnsafeExternalRowSorter.java:225)\n\u2003\u2003at org.apache.spark.sql.execution.exchange.ShuffleExchangeExec$.$anonfun$prepareShuffleDependency$10(ShuffleExchangeExec.scala:369)\n\u2003\u2003at org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2(RDD.scala:890)\n\u2003\u2003at org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2$adapted(RDD.scala:890)\n\u2003\u2003at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)\n\u2003\u2003at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:365)\n\u2003\u2003at org.apache.spark.rdd.RDD.iterator(RDD.scala:329)\n\u2003\u2003at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)\n\u2003\u2003at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:365)\n\u2003\u2003at org.apache.spark.rdd.RDD.iterator(RDD.scala:329)\n\u2003\u2003at org.apache.spark.shuffle.ShuffleWriteProcessor.write(ShuffleWriteProcessor.scala:59)\n\u2003\u2003at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:99)\n\u2003\u2003at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:52)\n\u2003\u2003at org.apache.spark.scheduler.Task.run(Task.scala:136)\n\u2003\u2003at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$3(Executor.scala:548)\n\u2003\u2003at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1504)\n\u2003\u2003at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:551)\n\u2003\u2003at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)\n\u2003\u2003at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)\n\u2003\u2003at java.lang.Thread.run(Thread.java:750)\n\n[Stage 25:======================================================&gt; (38 + 1) \/ 39]<\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Spark 2.1.0 \u4e4b\u5f8c\uff0cJDBC (Java Database Connectivity) \u88ab\u5f15\u5165\u4f5c\u70ba\u4e00\u500b Spark \u61c9\u7528\u7684\u5176\u4e2d\u4e00\u7a2e\u8f38\u5165\uff0c\u5728\u6578\u4f4d\u8f49\u578b\u7684\u904e\u7a0b\u4e2d\u6211\u5011\u5f88\u5e38\u9700\u8981\u5c07\u4f01\u696d\u4e2d\u7684\u8cc7\u6599\u5009\u5132\u532f\u5165\u8cc7\u6599\u6e56\u5eab\uff0c\u70ba\u4e86\u7701\u53bb\u8cc7\u6599\u8f49\u63db\u7684\u529f\u592b\uff0c\u6700\u597d\u7684\u65b9\u6cd5\u5c31\u662f\u76f4\u63a5\u5c07 RDBMS \u7684\u8cc7\u6599\u76f4\u63a5\u8f49\u9032 Datalake \u88e1\u9762\uff0c\u672c\u7bc7\u5e0c\u671b\u7d00\u9304\u5728 Spark 3.3.0 \u74b0\u5883\u4e2d\u5c07 MSSQL \u7684\u5de8\u91cf\u8cc7\u6599\u8f49\u79fb\u5230\u5730\u7aef\u7684\u8cc7\u6599\u6e56\u5eab\u4e2d\u7684\u904e\u7a0b\u8207 TroubleShooting\u3002<\/p>\n","protected":false},"author":1,"featured_media":5486,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9],"tags":[1854,1853,152],"class_list":["post-10066","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-bigdata-ml","tag-jdbc","tag-mssql","tag-spark"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/myoceane.fr\/index.php\/bigdata-\u5be6\u4f5c-spark-\u5c07-mssql-\u6a94\u6848\u5e73\u884c\u532f\u51fa\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane\" \/>\n<meta property=\"og:description\" content=\"Spark 2.1.0 \u4e4b\u5f8c\uff0cJDBC (Java Database Connectivity) \u88ab\u5f15\u5165\u4f5c\u70ba\u4e00\u500b Spark \u61c9\u7528\u7684\u5176\u4e2d\u4e00\u7a2e\u8f38\u5165\uff0c\u5728\u6578\u4f4d\u8f49\u578b\u7684\u904e\u7a0b\u4e2d\u6211\u5011\u5f88\u5e38\u9700\u8981\u5c07\u4f01\u696d\u4e2d\u7684\u8cc7\u6599\u5009\u5132\u532f\u5165\u8cc7\u6599\u6e56\u5eab\uff0c\u70ba\u4e86\u7701\u53bb\u8cc7\u6599\u8f49\u63db\u7684\u529f\u592b\uff0c\u6700\u597d\u7684\u65b9\u6cd5\u5c31\u662f\u76f4\u63a5\u5c07 RDBMS \u7684\u8cc7\u6599\u76f4\u63a5\u8f49\u9032 Datalake \u88e1\u9762\uff0c\u672c\u7bc7\u5e0c\u671b\u7d00\u9304\u5728 Spark 3.3.0 \u74b0\u5883\u4e2d\u5c07 MSSQL \u7684\u5de8\u91cf\u8cc7\u6599\u8f49\u79fb\u5230\u5730\u7aef\u7684\u8cc7\u6599\u6e56\u5eab\u4e2d\u7684\u904e\u7a0b\u8207 TroubleShooting\u3002\" \/>\n<meta property=\"og:url\" content=\"https:\/\/myoceane.fr\/index.php\/bigdata-\u5be6\u4f5c-spark-\u5c07-mssql-\u6a94\u6848\u5e73\u884c\u532f\u51fa\/\" \/>\n<meta property=\"og:site_name\" content=\"\u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane\" \/>\n<meta property=\"article:published_time\" content=\"2025-05-29T06:05:56+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-06T06:48:34+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"625\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"\u6ab8\u6aac\u7238\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"\u6ab8\u6aac\u7238\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/\"},\"author\":{\"name\":\"\u6ab8\u6aac\u7238\",\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b\"},\"headline\":\"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa\",\"datePublished\":\"2025-05-29T06:05:56+00:00\",\"dateModified\":\"2025-06-06T06:48:34+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/\"},\"wordCount\":69,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b\"},\"image\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png\",\"keywords\":[\"JDBC\",\"MSSQL\",\"Spark\"],\"articleSection\":[\"Big Data &amp; Machine Learning\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/\",\"url\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/\",\"name\":\"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane\",\"isPartOf\":{\"@id\":\"https:\/\/myoceane.fr\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png\",\"datePublished\":\"2025-05-29T06:05:56+00:00\",\"dateModified\":\"2025-06-06T06:48:34+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#primaryimage\",\"url\":\"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png\",\"contentUrl\":\"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png\",\"width\":1200,\"height\":625,\"caption\":\"apache-spark\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/myoceane.fr\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/myoceane.fr\/#website\",\"url\":\"https:\/\/myoceane.fr\/\",\"name\":\"M-Y-Oceane \u60f3\u65b9\u6d89\u6cd5\u3002\u91cf\u74f6\u5916\u7684\u5929\u7a7a\",\"description\":\"\u60f3\u65b9\u6d89\u6cd5, France, Taiwan, Health, Information Technology\",\"publisher\":{\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/myoceane.fr\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b\",\"name\":\"\u6ab8\u6aac\u7238\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/6cc678684664f8ad45a8d56a6630b183?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/6cc678684664f8ad45a8d56a6630b183?s=96&d=mm&r=g\",\"caption\":\"\u6ab8\u6aac\u7238\"},\"logo\":{\"@id\":\"https:\/\/myoceane.fr\/#\/schema\/person\/image\/\"},\"url\":\"https:\/\/myoceane.fr\/index.php\/author\/johnny5584767gmail-com\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/myoceane.fr\/index.php\/bigdata-\u5be6\u4f5c-spark-\u5c07-mssql-\u6a94\u6848\u5e73\u884c\u532f\u51fa\/","og_locale":"en_US","og_type":"article","og_title":"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane","og_description":"Spark 2.1.0 \u4e4b\u5f8c\uff0cJDBC (Java Database Connectivity) \u88ab\u5f15\u5165\u4f5c\u70ba\u4e00\u500b Spark \u61c9\u7528\u7684\u5176\u4e2d\u4e00\u7a2e\u8f38\u5165\uff0c\u5728\u6578\u4f4d\u8f49\u578b\u7684\u904e\u7a0b\u4e2d\u6211\u5011\u5f88\u5e38\u9700\u8981\u5c07\u4f01\u696d\u4e2d\u7684\u8cc7\u6599\u5009\u5132\u532f\u5165\u8cc7\u6599\u6e56\u5eab\uff0c\u70ba\u4e86\u7701\u53bb\u8cc7\u6599\u8f49\u63db\u7684\u529f\u592b\uff0c\u6700\u597d\u7684\u65b9\u6cd5\u5c31\u662f\u76f4\u63a5\u5c07 RDBMS \u7684\u8cc7\u6599\u76f4\u63a5\u8f49\u9032 Datalake \u88e1\u9762\uff0c\u672c\u7bc7\u5e0c\u671b\u7d00\u9304\u5728 Spark 3.3.0 \u74b0\u5883\u4e2d\u5c07 MSSQL \u7684\u5de8\u91cf\u8cc7\u6599\u8f49\u79fb\u5230\u5730\u7aef\u7684\u8cc7\u6599\u6e56\u5eab\u4e2d\u7684\u904e\u7a0b\u8207 TroubleShooting\u3002","og_url":"https:\/\/myoceane.fr\/index.php\/bigdata-\u5be6\u4f5c-spark-\u5c07-mssql-\u6a94\u6848\u5e73\u884c\u532f\u51fa\/","og_site_name":"\u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane","article_published_time":"2025-05-29T06:05:56+00:00","article_modified_time":"2025-06-06T06:48:34+00:00","og_image":[{"width":1200,"height":625,"url":"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png","type":"image\/png"}],"author":"\u6ab8\u6aac\u7238","twitter_card":"summary_large_image","twitter_misc":{"Written by":"\u6ab8\u6aac\u7238","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#article","isPartOf":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/"},"author":{"name":"\u6ab8\u6aac\u7238","@id":"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b"},"headline":"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa","datePublished":"2025-05-29T06:05:56+00:00","dateModified":"2025-06-06T06:48:34+00:00","mainEntityOfPage":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/"},"wordCount":69,"commentCount":0,"publisher":{"@id":"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b"},"image":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#primaryimage"},"thumbnailUrl":"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png","keywords":["JDBC","MSSQL","Spark"],"articleSection":["Big Data &amp; Machine Learning"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/","url":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/","name":"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa - \u60f3\u65b9\u6d89\u6cd5 - \u91cf\u74f6\u5916\u7684\u5929\u7a7a M-Y-Oceane","isPartOf":{"@id":"https:\/\/myoceane.fr\/#website"},"primaryImageOfPage":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#primaryimage"},"image":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#primaryimage"},"thumbnailUrl":"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png","datePublished":"2025-05-29T06:05:56+00:00","dateModified":"2025-06-06T06:48:34+00:00","breadcrumb":{"@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#primaryimage","url":"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png","contentUrl":"https:\/\/myoceane.fr\/wp-content\/uploads\/2020\/06\/Apache_Spark_logo.svg_.png","width":1200,"height":625,"caption":"apache-spark"},{"@type":"BreadcrumbList","@id":"https:\/\/myoceane.fr\/index.php\/bigdata-%e5%af%a6%e4%bd%9c-spark-%e5%b0%87-mssql-%e6%aa%94%e6%a1%88%e5%b9%b3%e8%a1%8c%e5%8c%af%e5%87%ba\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/myoceane.fr\/"},{"@type":"ListItem","position":2,"name":"[BigData] \u5be6\u4f5c Spark \u5c07 MSSQL \u6a94\u6848\u5e73\u884c\u532f\u51fa"}]},{"@type":"WebSite","@id":"https:\/\/myoceane.fr\/#website","url":"https:\/\/myoceane.fr\/","name":"M-Y-Oceane \u60f3\u65b9\u6d89\u6cd5\u3002\u91cf\u74f6\u5916\u7684\u5929\u7a7a","description":"\u60f3\u65b9\u6d89\u6cd5, France, Taiwan, Health, Information Technology","publisher":{"@id":"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/myoceane.fr\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/myoceane.fr\/#\/schema\/person\/4a4552fb8c27693083d465e12db7658b","name":"\u6ab8\u6aac\u7238","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/myoceane.fr\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/6cc678684664f8ad45a8d56a6630b183?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/6cc678684664f8ad45a8d56a6630b183?s=96&d=mm&r=g","caption":"\u6ab8\u6aac\u7238"},"logo":{"@id":"https:\/\/myoceane.fr\/#\/schema\/person\/image\/"},"url":"https:\/\/myoceane.fr\/index.php\/author\/johnny5584767gmail-com\/"}]}},"amp_enabled":false,"_links":{"self":[{"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/posts\/10066","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/comments?post=10066"}],"version-history":[{"count":16,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/posts\/10066\/revisions"}],"predecessor-version":[{"id":10085,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/posts\/10066\/revisions\/10085"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/media\/5486"}],"wp:attachment":[{"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/media?parent=10066"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/categories?post=10066"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/myoceane.fr\/index.php\/wp-json\/wp\/v2\/tags?post=10066"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}