Data synchronization in a replicated distributed database
ZHENG, XIAOMEI (2007)
ZHENG, XIAOMEI
2007
Tietojenkäsittelyoppi - Computer Science
Informaatiotieteiden tiedekunta - Faculty of Information Sciences
This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.
Hyväksymispäivämäärä
2007-05-03
Julkaisun pysyvä osoite on
https://urn.fi/urn:nbn:fi:uta-1-16774
https://urn.fi/urn:nbn:fi:uta-1-16774
Tiivistelmä
This study analyzes data protection and disaster recovery technologies and existing solutions. This work is carried out in Charging and Service Control business group inside Nokia Network. Our intelligent network has high availability implemented already in one data center by Oracle RAC. In order to achieve the continuously data availability in the event of a site disaster, another data center at a remote site is introduced into our IT infrastructure. The target is to minimize the downtime associated with the outage and preventing data loss in a site disaster, as well as to derive the most out of the disaster recovery infrastructure even in times when there is no disaster.
Two data centers mean two locations for data. The focus of this study is to find out how the data is synchronized between two databases residing on two different sites. The main goals are to understand what the efforts and costs are to adopt a certain solution, what the performance is, what problems can occur, how they can be solved and to illustrate the inherent limitations and challenges of some technologies.
The study deals with the general site disaster tolerance requirements from our customers. The problems found concern, for example, replication conflicts, limitation of inter-site connection technologies, essential of synchronous and asynchronous mode. The problems are analyzed on the basis of literature in the field of data synchronization in high availability and disaster tolerance environment.
Based on the comparison and balancing of costs, performance and availability, a compromised cost-efficient solution is proposed for our distributed database environment which has the essential of replication conflict. This solution is based on standby database technology that keeps a standby copy of the database at a remote site synchronized with the primary site.
Key words and terms: synchronization, replication, disaster tolerance, high availability, synchronous and asynchronous mode.
Two data centers mean two locations for data. The focus of this study is to find out how the data is synchronized between two databases residing on two different sites. The main goals are to understand what the efforts and costs are to adopt a certain solution, what the performance is, what problems can occur, how they can be solved and to illustrate the inherent limitations and challenges of some technologies.
The study deals with the general site disaster tolerance requirements from our customers. The problems found concern, for example, replication conflicts, limitation of inter-site connection technologies, essential of synchronous and asynchronous mode. The problems are analyzed on the basis of literature in the field of data synchronization in high availability and disaster tolerance environment.
Based on the comparison and balancing of costs, performance and availability, a compromised cost-efficient solution is proposed for our distributed database environment which has the essential of replication conflict. This solution is based on standby database technology that keeps a standby copy of the database at a remote site synchronized with the primary site.
Key words and terms: synchronization, replication, disaster tolerance, high availability, synchronous and asynchronous mode.