當(dāng)前位置：首頁 >

MongoDB Sharding分片配置

發(fā)布時間：2025/7/14 39 豆豆

生活随笔收集整理的這篇文章主要介紹了 MongoDB Sharding分片配置小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

Ps:mongod是mongodb實(shí)例，mongos被默認(rèn)為為mongodb sharding的路由實(shí)例。本文使用的mongodb版本為3.2.9，因此參考網(wǎng)址為：https://docs.mongodb.com/v3.2/sharding/ 此外最后幾個部分還引用了https://yq.aliyun.com/articles/60096中的一些問題描述及解決方案。 一、Sharding集群簡介
1.數(shù)據(jù)分片（Shards）用來保存數(shù)據(jù)，保證數(shù)據(jù)的高可用性和一致性。可以是一個單獨(dú)的mongod實(shí)例，也可以是一個副本集。在生產(chǎn)環(huán)境下Shard一般是一個Replica Set，以防止該數(shù)據(jù)片的單點(diǎn)故障。可以將所有shard的副本集放在一個服務(wù)器多個mongodb實(shí)例中。 2.查詢路由（Query Routers）路由就是mongos的實(shí)例，客戶端直接連接mongos，由mongos把讀寫請求路由到指定的Shard上去。一個Sharding集群，可以有一個mongos，也可以如上圖所示為每個App Server配置一個mongos以減輕路由壓力。注意這里的mongos并不要配置為rs，因?yàn)橹皇莻€路由，并不存儲數(shù)據(jù)，配置多個mongos的意思是配置多個單獨(dú)的mongos實(shí)例。 3.配置服務(wù)器（Config servers）保存集群的元數(shù)據(jù)（metadata），包含各個Shard的路由規(guī)則。3.2版本以后config server可以配置為replica set(CSRS),3.4以后config server必須配置為rs。 config server的rs不能有arbiter（3.2.9版本是這樣，其他版本未測試），生產(chǎn)上建議config server的rs至少要有3個副本集成員。 MongoDB是在collection級別實(shí)現(xiàn)的水平分片。 ? 二、分片鍵：Shard keys

shard key在sharding搭建完畢后是不能修改的，一個collection上只能有一個shard key。
shard key上必須有索引（可以是以shard key開頭的聯(lián)合索引），如果沒有mongodb會為shard key創(chuàng)建索引。如果是已經(jīng)存在的collection那么必須手動為shard key創(chuàng)建索引。
在sharding的collection中只有_id和shard key前綴的索引可以是unique index，其他索引只能是普通索引。如果一個普通key上有unique index那么你不能以其他key為shard key對collection進(jìn)行sharding。

shard key的選擇將會影響整個集群的效率，可擴(kuò)展性和性能。而且也會影響你所能選擇的分片策略。

關(guān)于shard key詳見：https://docs.mongodb.com/v3.2/core/sharding-shard-key/ 分片范圍是[shard_key_value_m,shard_key_value_n)，MongoDB把每個分片叫做一個shard，一部分shard key的集合叫做chunk，一個shard上可以有多個chunk也可以只有一個chunk，一般會有多個。 ? 三、Sharding的優(yōu)勢 1.讀寫方面： sharding將讀寫負(fù)載均勻到各個shard，且workload上限可以通過水平擴(kuò)展來增加。 2.擴(kuò)容方面：每個shard保存一部分?jǐn)?shù)據(jù)，可以通過增加shards來擴(kuò)容。 3.高可用方面：即便某個shard不可用了，整個集群也可以對外提供服務(wù)，只不過訪問down掉的shard會報"Connection refused"的錯誤。而且MongoDB3.2以后可以為每個shard都配置副本集（replica set），這樣保證最大程度的高可用性。 ? 四、Sharding的劣勢 數(shù)據(jù)量較少時不建議使用sharding，畢竟讀寫都要經(jīng)過一層路由會有性能損耗，直接表現(xiàn)就是ips和qps會降低。 ? 五、使用Sharding前需要考慮的一些事情 1.sharding集群不支持一些常規(guī)的單實(shí)例方法，如group()，可以使用mapReduce()或者aggregate()中的group來替代，因此建議從一開始學(xué)習(xí)就直接使用aggregate(),這種寫法較為簡單明了，且統(tǒng)一化易于識別。 2.對于沒有用到shard key的查詢，路由進(jìn)行全集群廣播（broadcast operation），對每個shard都查一遍進(jìn)行scatter/gather，此時效率會很低。 3.生產(chǎn)上使用副本集或sharding時，要考慮到安全認(rèn)證的問題，除了開啟對外的auth賬戶認(rèn)證外，集群節(jié)點(diǎn)間最好指定keyfile啟動，這樣可以防止陌生節(jié)點(diǎn)隨意加入集群。 ? 六、Sharding策略選擇 1.hash sharding：https://docs.mongodb.com/v3.2/core/hashed-sharding/ 當(dāng)shard key總是單調(diào)遞增時hash sharding并不是一個很好的選擇，其查詢分發(fā)基本和broadcast operation一樣了，因?yàn)閔ash會把數(shù)據(jù)比較均勻的分布在各個shard上，但此時選擇ranged sharding也有缺點(diǎn)，因?yàn)閿?shù)據(jù)過度集中會導(dǎo)致數(shù)據(jù)集中于某個shard。 2.ranged sharding：https://docs.mongodb.com/v3.2/core/ranged-sharding/ 在shard key選取不正確的情況下，范圍分片會導(dǎo)致數(shù)據(jù)分布不均勻，也可能遭遇性能瓶頸，因此需要合理的選擇ranged shard key。 3.Tag aware sharding：https://docs.mongodb.com/v3.2/core/tag-aware-sharding/ 原理如下：

sh.addShardTag() 給shard設(shè)置標(biāo)簽A
sh.addTagRange() 給集合的某個chunk范圍設(shè)置標(biāo)簽A，最終MongoDB會保證設(shè)置標(biāo)簽 A 的chunk范圍（或該范圍的超集）分布設(shè)置了標(biāo)簽 A 的 shard 上。

Tag aware sharding可應(yīng)用在如下場景：將部署在不同機(jī)房的shard設(shè)置機(jī)房標(biāo)簽，將不同chunk范圍的數(shù)據(jù)分布到指定的機(jī)房將服務(wù)能力不通的shard設(shè)置服務(wù)等級標(biāo)簽，將更多的chunk分散到服務(wù)能力更強(qiáng)的shard上去。 ? 使用 Tag aware sharding 需要注意是,chunk分配到對應(yīng)標(biāo)簽的shard上不是立即完成，而是在不斷insert、update后觸發(fā)split、moveChunk后逐步完成的，并且需要保證balancer是開啟的。所以你可能會觀察到，在設(shè)置了tag range后一段時間后，寫入仍然沒有分布到tag相同的shard上去。 ? 七、Sharding搭建步驟: 關(guān)于sharding的操作方法參考：https://docs.mongodb.com/v3.2/reference/method/js-sharding/ 環(huán)境說明： MongoDB版本：3.2.9 節(jié)點(diǎn)：192.168.20.70/71/72 架構(gòu)說明： 70：包含mongos、config server（master）、3個shards(master) 71：包含config server（slave）、3個shards(slave) 72：包含3個shards(arbiter) --網(wǎng)上很多資料說config server必須是奇數(shù)個，但至少在本次搭建的3.2.9版本中2個也是可以的。 1.配置config server --master的mongo.conf(192.168.20.70) directoryperdb=true replSet=config configsvr=true logpath=/home/mongod/config_master/mongod.log logappend=true fork=true port=27018 dbpath=/home/mongod/config_master pidfilepath=/home/mongod/config_master/mongod.pid --slave的mongo.conf(192.168.20.71) directoryperdb=true replSet=config configsvr=true logpath=/home/mongod/config_slave/mongod.log logappend=true fork=true port=27018 dbpath=/home/mongod/config_slave pidfilepath=/home/mongod/config_slave/mongod.pid 然后啟動并配置config server的rs(replica set): mongod -f /home/mongod/config_master/mongo.conf mongod -f /home/mongod/config_slave/mongo.conf use admin cfg={_id:"config",members:[{_id:0,host:'192.168.20.70:27018',priority:2}, {_id:1,host:'192.168.20.71:27018',priority:1}]}; rs.initiate(cfg) 2.配置shards 本例中配置了3個shards，分別使用70服務(wù)器的27017,27020,27021端口，他們的slave和arbiter分別使用71和72服務(wù)器上的相同端口。 --shard1的master、slave、arbiter的配置文件（分別在70、71、72上） --master: directoryperdb=true replSet=shard1 shardsvr = true logpath=/home/mongod/shard1_master/mongod.log logappend=true fork=true port=27017 dbpath=/home/mongod/shard1_master pidfilepath=/home/mongod/shard1_master/mongod.pid --slave: directoryperdb=true replSet=shard1 shardsvr = true logpath=/home/mongod/shard1_slave/mongod.log logappend=true fork=true port=27017 dbpath=/home/mongod/shard1_slave pidfilepath=/home/mongod/shard1_slave/mongod.pid --arbiter: directoryperdb=true replSet=shard1 shardsvr = true logpath=/home/mongod/shard1_arbiter/mongod.log logappend=true fork=true port=27017 dbpath=/home/mongod/shard1_arbiter pidfilepath=/home/mongod/shard1_arbiter/mongod.pid shard2和shard3的配置文件與shard1基本一致，只需要把相應(yīng)的replSet設(shè)為shard2\shard3,相應(yīng)的目錄修改為shard2\shard3,相應(yīng)的端口修改為27020/27021即可。建好相應(yīng)的dbpath目錄后，啟動并為每個shard配置replica set，步驟如下： use admin cfg={_id:"shard1",members:[{_id:0,host:'192.168.20.70:27017',priority:2}, {_id:1,host:'192.168.20.71:27017',priority:1},{_id:2,host:'192.168.20.72:27017',arbiterOnly:true}]}; rs.initiate(cfg) shard2和shard3的配置步驟一樣，只需要把shard1修改為shard2/shard3,把端口修改為27020/27021即可。 3.完成config server和shards的rs配置后，就可以配置路由服務(wù)器了，路由服務(wù)器的官方名稱是mongos，我們這里也以mongos稱呼。 本例中只配置一個mongos，方法如下： --注意：dbpath、directoryperdb等參數(shù)是不能出現(xiàn)在mongos的配置文件中的，簡單起見只配置如下參數(shù)即可： configdb = config/192.168.20.70:27018,192.168.20.71:27018 --這里的config是config server副本集的名稱，后接config server的2個副本集節(jié)點(diǎn)。 logpath=/home/mongod/mongos/mongod.log logappend=true fork=true port=27019 pidfilepath=/home/mongod/mongos/mongod.pid 然后啟動mongos，注意mongos的啟動是與其他類型的mongo實(shí)例不一樣的：（用的mongos而不是mongod命令） mongos -f /home/mongod/mongos/mongo.conf 4.至此完成了所有服務(wù)器的配置，接下來開始配置具體collection的分片策略。 登錄mongos服務(wù)器： mongo --port=27019 use admin sh.addShard("shard1/192.168.20.70:27017,192.168.20.71:27017,192.168.20.72:27017"); sh.addShard("shard2/192.168.20.70:27020,192.168.20.71:27020,192.168.20.72:27020"); sh.addShard("shard3/192.168.20.70:27021,192.168.20.71:27021,192.168.20.72:27021"); 然后在mongos上為具體的數(shù)據(jù)庫配置sharding: sh.enableSharding("test") --允許test數(shù)據(jù)庫進(jìn)行sharding sh.shardCollection("test.t",{id:"hashed"}) --對test.t集合以id列為shard key進(jìn)行hashed sharding 通過db.t.getIndexes()可以看到自動為id列創(chuàng)建了索引。 5.hashed分片驗(yàn)證 在第4步中針對test的t集合進(jìn)行了分片配置，因此這里向t插入1000條數(shù)據(jù)做測試： mongo --port=27019 --27019是mongos的端口號 use test for(i=1,i<=1000,i++){db.t.insert({id:i,name:"Leo"})} 在3個shard的primary上使用db.t.find().count()會發(fā)現(xiàn)1000條數(shù)據(jù)近似均勻的分布到了3個shard上。使用db.t.stats()查看分片結(jié)果，使用sh.status()查看本庫內(nèi)所有集合的分片信息。 6.其他分片方式 sh.shardCollection("test.t",{id:1}) --對test.t集合以id列為shard key進(jìn)行ranged sharding ranged分片直接使用{id:1}方式指定即可，分片的chunk由mongos自主決定，例如在ranged分片集合中插入1000條數(shù)據(jù)，其結(jié)果如下： for(i=1;i<=1000;i++){db.t.insert({id:i,name:"Leo"})}
--sh.status()的相關(guān)結(jié)果： test.t shard key: { "id" : 1 } unique: false balancing: true chunks: shard1 1 shard2 1 shard3 1 { "id" : { "$minKey" : 1 } } -->> { "id" : 2 } on : shard1 Timestamp(2, 0) { "id" : 2 } -->> { "id" : 22 } on : shard3 Timestamp(3, 0) { "id" : 22 } -->> { "id" : { "$maxKey" : 1 } } on : shard2 Timestamp(3, 1) 從sh.status的結(jié)果可以看到id為[1,2)的被分配至shard1，[2,22)被分配至shard2，其他的全部被分配至shard3,分布極其不均勻。由于默認(rèn)的ranged sharding策略會導(dǎo)致自增shard key分布及其不均勻，我們需要在定時的使用sh.splitAt()方法來為分片指定分片chunk大小： sh.splitAt("test.t",{id:500}) sh.splitAt("test.t",{id:1000}) sh.splitAt("test.t",{id:1500}) sh.splitAt("test.t",{id:2000}) for(i=1;i<=3000;i++){db.t.insert({id:i,name:"Leo"})} --sh.status()顯示的分片結(jié)果如下： test.t shard key: { "id" : 1 } unique: false balancing: true chunks: shard1 2 --shard2上有2個chunks，分別是[1500,2000]和[2000,$maxKey) shard2 2 shard3 1 { "id" : { "$minKey" : 1 } } -->> { "id" : 500 } on : shard1 Timestamp(2, 0) { "id" : 500 } -->> { "id" : 1000 } on : shard3 Timestamp(3, 0) { "id" : 1000 } -->> { "id" : 1500 } on : shard1 Timestamp(4, 0) { "id" : 1500 } -->> { "id" : 2000 } on : shard2 Timestamp(4, 1) { "id" : 2000 } -->> { "id" : { "$maxKey" : 1 } } on : shard2 Timestamp(3, 3) tag aware分片策略還未測試，有待以后補(bǔ)充。
7.shards的擴(kuò)容 當(dāng)需要水平擴(kuò)容時我們就需要進(jìn)行shards添加了，添加步驟如下：（本例在70上直接添加單實(shí)例的27022端口的shard實(shí)例） directoryperdb=true shardsvr = true logpath=/home/mongod/shard4/mongod.log logappend=true fork=true port=27022 dbpath=/home/mongod/shard4 pidfilepath=/home/mongod/shard4/mongod.pid 啟動此實(shí)例后，在mongos上執(zhí)行： sh.addShard("192.168.20.70:27022") 一段時間后sh.status()看到的結(jié)果如下： test.t shard key: { "id" : 1 } unique: false balancing: true chunks: shard1 1 shard0004 1 --mongos自動將新的單實(shí)例mongoDB的chunk命名為shard0004 shard2 2 shard3 1 { "id" : { "$minKey" : 1 } } -->> { "id" : 500 } on : shard0004 Timestamp(5, 0) { "id" : 500 } -->> { "id" : 1000 } on : shard3 Timestamp(3, 0) { "id" : 1000 } -->> { "id" : 1500 } on : shard1 Timestamp(5, 1) { "id" : 1500 } -->> { "id" : 2000 } on : shard2 Timestamp(4, 1) { "id" : 2000 } -->> { "id" : { "$maxKey" : 1 } } on : shard2 Timestamp(3, 3) --可以看到balancer自動將chunk進(jìn)行了遷移，遷移機(jī)制為mongodb內(nèi)部決定，原理參見第八部分。 八、Sharding的負(fù)載均衡（即Balancer） MongoDB Sharding的自動負(fù)載均衡目前是由mongos的后臺線程來做的，并且每個集合同一時刻只能有一個遷移任務(wù)，負(fù)載均衡主要根據(jù)集合在各個 shard上chunk的數(shù)量來決定的，相差超過一定閾值（跟chunk總數(shù)量相關(guān)）就會觸發(fā)chunk遷移。 Balancer默認(rèn)是開啟的，為了避免chunk遷移影響到線上業(yè)務(wù)，可以通過設(shè)置遷移執(zhí)行窗口，比如只允許凌晨2:00-6:00期間進(jìn)行遷移。 mongo --port=27019 --連接到mongos use config db.settings.update( { _id: "balancer" }, { $set: { activeWindow : { start : "02:00", stop : "06:00" } } }, { upsert: true } ) Balancer會在服務(wù)器local time的凌晨2-6點(diǎn)才執(zhí)行chunk的balance。另外，在進(jìn)行sharding備份時（通過mongos或者單獨(dú)備份config server和所有shard），需要停止負(fù)載均衡以免備份出來的數(shù)據(jù)出現(xiàn)狀態(tài)不一致問題。 sh.setBalancerState("false") 或者： sh.stopBalancer() 九、其他問題 moveChunk歸檔設(shè)置 使用3.0及以前版本的Sharded cluster可能會遇到一個問題，停止寫入數(shù)據(jù)后，數(shù)據(jù)目錄里的磁盤空間占用還會一直增加。上述行為是由sharding.archiveMovedChunks配置項(xiàng)決定的，該配置項(xiàng)在3.0及以前的版本默認(rèn)為true，即在move chunk時，源shard會將遷移的chunk數(shù)據(jù)歸檔一份在數(shù)據(jù)目錄里，當(dāng)出現(xiàn)問題時，可用于恢復(fù)。也就是說，chunk發(fā)生遷移時，源節(jié)點(diǎn)上的空間并沒有釋放出來，而目標(biāo)節(jié)點(diǎn)又占用了新的空間。在3.2版本，該配置項(xiàng)默認(rèn)值也被設(shè)置為false，默認(rèn)不會對moveChunk的數(shù)據(jù)在源shard上歸檔。 recoverShardingState設(shè)置 使用MongoDB Sharded cluster時，還可能遇到一個問題，就是啟動 shard后，shard 不能正常服務(wù)，Primary上調(diào)用ismaster時，結(jié)果卻為 true，也無法正常執(zhí)行其他命令，其狀態(tài)類似如下： PRIMARY> db.isMaster() { "hosts" : [ "host1:9003", "host2:9003", "host3:9003" ], "setName" : "mongo-9003", "setVersion" : 9, "ismaster" : false, // primary 的 ismaster 為 false？？？ "secondary" : true, "primary" : "host1:9003", "me" : "host1:9003", "electionId" : ObjectId("57c7e62d218e9216c70aa3cf"), "maxBsonObjectSize" : 16777216, "maxMessageSizeBytes" : 48000000, "maxWriteBatchSize" : 1000, "localTime" : ISODate("2016-09-01T12:29:27.113Z"), "maxWireVersion" : 4, "minWireVersion" : 0, "ok" : 1 } 查看其錯誤日志，會發(fā)現(xiàn)shard一直無法連接上config server，上述行為是由sharding.recoverShardingState選項(xiàng)決定，默認(rèn)為true，也就是說，shard啟動時，其會連接config server進(jìn)行sharding 狀態(tài)的一些初始化，而如果config server連不上，初始化工作就一直無法完成，導(dǎo)致 shard 狀態(tài)不正常。有同學(xué)在將Sharded cluster所有節(jié)點(diǎn)都遷移到新的主機(jī)上時遇到了上述問題，因?yàn)閏onfig server的信息發(fā)生變化了，而shard啟動時還會連接之前的config server，通過在啟動命令行加上--setParameter recoverShardingState=false來啟動shard就能恢復(fù)正常了。 ? 上述默認(rèn)設(shè)計的確有些不合理，config server的異常不應(yīng)該去影響shard，而且最終的問題的表象也很不明確，在3.4大版本里，MongoDB也會對這塊進(jìn)行修改去掉這個參數(shù)，默認(rèn)不會有recoverShardingState的邏輯，具體參考SERVER-24465。

轉(zhuǎn)載于:https://www.cnblogs.com/leohahah/p/8652572.html

總結(jié)

以上是生活随笔為你收集整理的MongoDB Sharding分片配置的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯，歡迎將生活随笔推薦給好友。

上一篇： 3-3 面向对象本章总结
下一篇： Fibonacci数列时间复杂度之美妙

日韩av黄I国产麻豆传媒I国产91av视频在线观看I日韩一区二区三区在线看I美女国产在线I麻豆视频国产在线观看I成人黄色短片

MongoDB Sharding分片配置

總結(jié)