èŠ‚ç‚¹æ–‡çŒ®

å¹¶è¡Œå†…å˜æ•°æ®åº“å¿«é€Ÿäº‹åŠ¡æäº¤ä¸Žé«˜æ•ˆæ¢å¤æ–¹æ³•ç ”ç©¶

Research of Fast Commit and Highly Efficient Recovery Method for Parallel Main Memory Database

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å‘¨æ™“äº‘ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ ä¸å›½çŸ¿ä¸šå¤§å¦ ï¼Œ é€šä¿¡ä¸Žä¿¡æ¯ç³»ç»Ÿï¼Œ 2009ï¼Œ åšå£«

ã€æ‘˜è¦ã€‘ æœ¬æ–‡ç ”ç©¶é¢å‘ClusterçŽ¯å¢ƒçš„å¹¶è¡Œå†…å˜æ•°æ®åº“çš„å¿«é€Ÿäº‹åŠ¡æäº¤ä¸Žé«˜æ•ˆæ¢å¤æ–¹æ³•,ä¸»è¦åŒ…æ‹¬ä¸‰ä¸ªæ–¹é¢çš„å†…å®¹:å¿«é€Ÿäº‹åŠ¡æäº¤å’Œæ—¥å¿—å¤„ç†ã€æ£€æŸ¥ç‚¹æ“ä½œã€å¹¶è¡Œæ•°æ®åº“çš„æ¢å¤ã€‚æœ¬æ–‡æ”¹è¿›å•é˜¶æ®µæäº¤åè®®,é€šè¿‡æ—¥å¿—ä¿¡æ¯çš„å¹¶è¡Œå†™å…¥ç¡¬ç›˜,å……åˆ†åˆ©ç”¨IOå¸¦å®½,åŠ å¿«äº‹åŠ¡çš„æäº¤,é¿å…æ›´æ–°å¯†é›†åž‹åº”ç”¨ä¸æ—¥å¿—çš„å †ç§¯ã€‚ä¼ ç»Ÿä¸¤é˜¶æ®µé”åè®®å¯¼è‡´è¯»å†™äº‹åŠ¡çš„åŠ é”å†²çª,é™ä½Žç³»ç»Ÿçš„åžåé‡ã€‚æœ¬æ–‡æŠŠåŠ é”åè®®å’Œä¸´æ—¶ç‰ˆæœ¬ç®¡ç†ç»“åˆèµ·æ¥,é€šè¿‡ç‰ˆæœ¬ç®¡ç†å®žçŽ°æ— å µå¡žçš„è¯»äº‹åŠ¡,é¿å…äº†è¯»å†™äº‹åŠ¡ä¹‹é—´çš„äº’ç›¸ç‰å¾…ã€‚åŸºäºŽå¤šç‰ˆæœ¬ç®¡ç†å®žçŽ°ä¸€è‡´æ£€æŸ¥ç‚¹,å¿…é¡»ä»˜å‡ºç‰ˆæœ¬ç®¡ç†çš„ç©ºé—´å¼€é”€ä»£ä»·ã€‚æœ¬æ–‡é‡‡ç”¨å…ƒç»„çº§åˆ«çš„ç‰ˆæœ¬ç®¡ç†å’Œç‰ˆæœ¬å…±äº«æŠ€æœ¯,ç‰ˆæœ¬ç®¡ç†çš„å¼€é”€å¤§å¤§é™ä½Žã€‚åœ¨å†…å˜è¶Šæ¥è¶Šå¤§çš„æƒ…å†µä¸‹,è¿™äº›ä»£ä»·æ˜¯åˆç†çš„,å› ä¸ºç³»ç»Ÿäº‹åŠ¡å¤„ç†èƒ½åŠ›å’Œæ£€æŸ¥ç‚¹æ“ä½œæ•ˆçŽ‡å¾—åˆ°äº†è¾ƒå¤§æé«˜ã€‚æœ¬æ–‡æå‡ºåŸºäºŽæ•°æ®åˆ†åŒºçš„å¹¶è¡Œæ¢å¤ç®—æ³•,å®žçŽ°äº†æ¢å¤è¿‡ç¨‹çš„ç³»ç»Ÿå¯ç”¨æ€§ã€‚æ¢å¤è¿‡ç¨‹ä¸,å„ä¸ªç«™ç‚¹çš„æ¢å¤å·¥ä½œæ˜¯ç›¸äº’ç‹¬ç«‹çš„,åŒæ—¶åˆ©ç”¨å·®åˆ†æ—¥å¿—çš„ç‰¹ç‚¹,å®žçŽ°äº†æ•°æ®åˆ†åŒºä¹‹é—´,æ—¥å¿—ä¹‹é—´ã€æ•°æ®å’Œæ—¥å¿—ä¹‹é—´çš„å¹¶è¡Œå¤„ç†,åŠ å¿«äº†æ¢å¤è¿‡ç¨‹,å‡å°‘äº†ç«™ç‚¹æ¢å¤çš„æ€»æ—¶é—´ã€‚æœ¬æ–‡ä½¿ç”¨J-SIMè½¯ä»¶åŒ…å»ºæ¨¡è¿›è¡Œä»¿çœŸå®žéªŒ,éªŒè¯äº†æ‰€ææ–¹æ¡ˆçš„å¯è¡Œæ€§å’Œæ•ˆçŽ‡ã€‚ç»“æžœæ˜¾ç¤º:(1)ç”±äºŽä½¿ç”¨å¿«é€Ÿæäº¤æŠ€æœ¯å’Œå¹¶è¡Œæ—¥å¿—å†™å…¥,äº‹åŠ¡å“åº”æ—¶é—´ä»Ž50msé™ä½Žåˆ°21ms;(2)ä½¿ç”¨å¹¶è¡Œæ¢å¤ç®—æ³•,ç«™ç‚¹å¤±è´¥çš„æ¢å¤æ—¶é—´ä»Ž65 sé™ä½Žåˆ°28ç§’;(3)æŸ¥è¯¢äº‹åŠ¡çš„åžåé‡æ¯”æ¨¡ç³Šæ£€æŸ¥ç‚¹é«˜67%å·¦å³,è€Œæ›´æ–°äº‹åŠ¡çš„åžåé‡æ¯”æ¨¡ç³Šæ£€æŸ¥ç‚¹é«˜7.8%å·¦å³; (4)åœ¨80%æ›´æ–°äº‹åŠ¡çš„å¯†é›†åœºæ™¯ä¸,ç‰ˆæœ¬ç®¡ç†çš„ç©ºé—´å¼€é”€åœ¨11%å·¦å³ã€‚(5)å®žéªŒæµ‹è¯•çš„æ¢å¤è¿‡ç¨‹ä¸çš„4ä¸ª(1/4)æ—¶é—´æ®µ,ç³»ç»Ÿå¹³å‡åžåé‡åˆ†åˆ«ä¸º90.2Ktpsã€98.3Ktpsã€104.5Ktpsã€107.7Ktps,äº‹åŠ¡çš„å¹³å‡å“åº”æ—¶é—´åˆ†åˆ«ä¸º273msã€32.3msã€9.2msã€5.32msã€‚è¯¥è®ºæ–‡æœ‰å›¾49å¹…,è¡¨5ä¸ª,å‚è€ƒæ–‡çŒ®121ç¯‡ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ The dissertation focuses on fast committing protocol and higly efficient recovery schems for parallel main memory database on clusters, including: fast transaction committing protocol and logging, checkpointing, and recovery of the parallel main memory database.The dissertation has enhanced traditional one phase committing protocol, and propose using parallel log writing to fully utilize the IO bandwidth to accelerate transaction committing, thus avoided log accumulation in update intensive applications. Traditional two phase locking leads to lock conficts between read only transations and update transactions, which lowers down system throughput. A novel transaction schedule protocol is proposed in this dissertation, transient versioning is combined with locking to support non blocking reading, and avoid conflicts between reader and writers.The consitent checkpointing is implemented on multi versiong, thus space overhead is necessary. Since multi versioning is done on tuple level, and version sharing is used, the overhead is reduced. At present, the capacity of main memory is getting larger and larger, the cost is reasonable, because the efficiency of transaction processing and checkpointing is improved.A partition based parallel recovery algorithm is proposed to provide system availability during recovery. During recovery, recovery of individual sites is independent, parallelism of three types, namely parallelism between partition, parallelism between log disks, and parallelism between data and log, are exploited to speedup recovery, the total recovery time is cut down.The author has used the J-SIM software package to build a simulation system and conducted a seria of experiments, the feasibility and efficiency of the scheme proposed in this dissertation are verified. Experiment results show that: (1) Transaction response time is cut down owing to parallel log writing, from 50ms to 21 ms when log disk number is 8. (2) Total recovery time of the failure site is reduced from 65s to 28s. (3) The scheme achives higher performance than fuzzy checkpointing when the system is performing checkpointing. Query throughput of the scheme improves by about 67 percent over fuzzy checkpointing, and update transaction throughput improves by about 7.8 percent over fuzzy checkpointing. (4) The space overhead is around 11 percent in update intensive senarios, the overhead is acceptable. (5) The final experiment is conducted to measure system throughput and transaction response times during recovery.during four quarters of the recovery, system throughputs are 90.2Ktpsã€98.3Ktpsã€104.5Ktpsã€107.7Ktps, and avarage transaction response time are 273msã€32.3msã€9.2msã€5.32ms respectively.Experimnets results have verified the effetiveness and efficiency of the proposed scheme.The dissertation includes 49 digrams, and 5 tables, and refers to 121 papers.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ Clusterï¼› å¹¶è¡Œå†…å˜æ•°æ®åº“ï¼› å¹¶è¡Œæ¢å¤ï¼› ä¸€è‡´æ£€æŸ¥ç‚¹ï¼› å…ƒç»„çº§ï¼› å¤šç‰ˆæœ¬ç®¡ç†ï¼›
ã€Key wordsã€‘ Clusterï¼› Parallel Main Memory Database Systemï¼› Parallel Recoveryï¼› Transaction Consistent Checkpointingï¼› Tuple Levelï¼› Multi Version Managementï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ ä¸å›½çŸ¿ä¸šå¤§å¦

ã€åˆ†ç±»å·ã€‘TP311.13
ã€è¢«å¼•é¢‘æ¬¡ã€‘1
ã€ä¸‹è½½é¢‘æ¬¡ã€‘443
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

å¹¶è¡Œå†…å­˜æ•°æ®åº“å¿«é€Ÿäº‹åŠ¡æäº¤ä¸Žé«˜æ•ˆæ¢å¤æ–¹æ³•ç ”ç©¶

Research of Fast Commit and Highly Efficient Recovery Method for Parallel Main Memory Database

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

å¹¶è¡Œå†…å˜æ•°æ®åº“å¿«é€Ÿäº‹åŠ¡æäº¤ä¸Žé«˜æ•ˆæ¢å¤æ–¹æ³•ç ”ç©¶