Evan Reiser's Blog: How Online Indexing Works

Friday, April 27, 2007

How Online Indexing Works

The default behavior of either method of rebuilding an index is that SQL Server takes an exclusive lock on the index, so it is completely unavailable while the index is being rebuilt. If the index is clustered, the entire table is unavailable; if the index is non-clustered, there is a shared lock on the table meaning no modifications can be made but other processes can SELECT from the table (But obviously they cannot take advantage of the index being rebuilt). Now this is pretty miserable in large databases since queries wont be able to take advantage of indexes resulting in our arch nemisis: table scans.

The online build works by maintaining two copies of the index simultaneously, the original (source) and the new one (target). The target is used only for writing any changes made while the rebuild is going on. All reading is done from source as well. SQL Server row-level versioning is used so anyone retrieving information from the index will be able to read consistent data.

Here are the steps involved in rebuilding a non-clustered index

A shared lock is taken on the index, which prevents any data modification queries and an Intent-Shared lock is taken on the table
The index is created with the same structures as the original and marked as write-only
The shared lock is released on the index, leaving only the Intent-Shared lock on the table.
A versioned scan is started on the original index, which means modifications made during the scan will be ignored. The scanned data is copied to the target
All subsequent modifications will write to both the source and the target. Reads will use only the source
The scan of the source and copy to the target continues while normal operations are performed.
The scan completes
A Schema-Modification-Lock (most strict lock) is taken to make the source completely unavailable
The source is dropped, metadata is updated, and the target is made to be read-write
The Schema-Modification-Lock is released.

A clustered index rebuild works exactly like a non-clustered rebuild property as long as there is no schema change (a change of index keys or uniqueness property).

For a build of a new clustered index or a rebuild of a clustered index with a schema change there are a few more differences. First, an intermediate mapping index is used to translate between the source and target physical structures. Additionally, all existing non-clustered indexes are rebuilt one at a time after a new base table ahs been built. Creating a clustered index on a heap with two non-clustered indexes involves the following steps:

Create a new write-only clustered Index
Create a new non-clustered index based on the new clustered index
Create another new non-clustered index based on the new clustered index
Drop the heap and the two original non-clustered indexes

Online Index rebuilding can be costly as the server must maintain up to 6 structures at the same time, however this is incredibly useful for removing fragmentation or re-establishing a fillfactor when the data must be available 24/7 in high availability systems.

No comments:

Post a Comment

Disclaimer

Evan Reiser's Blog on Technology, Database Systems, Computer Systems, AJAX, ASP.NET, SQL Server Financial Advice Disclaimer: I, Evan Reiser provide general information, not individually targeted personalised advice. Advice from this site does not take into account any investor’s particular investment objectives, financial situation and personal needs. Investors should assess for themselves whether the advice is appropriate to their individual investment objectives, financial situation and particular needs before making any investment decision on the basis of such general advice. Investors can make their own assessment of the advice or seek the assistance of a professional adviser. Investing entails some degree of risk. Investors should inform themselves of the risks involved before engaging in any investment. I, Evan Reiser, endeavor to ensure accuracy and reliability of the information provided but does not accept any liability whatsoever, whether in tort or contract or otherwise, for any loss or damage arising from the use of this site's data and systems. Past performance is not necessarily indicative of future results. Information and advice provided here is not an offer to buy or sell securities. Before commencing an investment program I recommend you seek independent professional legal, tax and investment advice as to whether it is suitable for your particular needs and circumstances. Failure to seek detailed professional personally tailored advice prior to acting could lead to you acting contrary to your own best interests and could lead to losses of capital. I, Evan Reiser, expressly deny any liability to you for loss in any manner or form now or at any time in the future. You should be aware that some investments will lose money. Conscious investment selections are on the basis of probabilities - that they are proven profitable at some point in time in the future more often than not. Any action based on this information should observe standard investment and trading rules such as diversification, stop losses and matching to personal risk tolerances. Investing strategies and actions discussed in our publications may not be suitable for you. You must make your own investment decisions in light of your own circumstances.

Evan Reiser's Blog

Friday, April 27, 2007

How Online Indexing Works

No comments:

Deal of the Day

About Me

Evan's Links

Popular Posts

Contact

Blog Archive

Relevant Links

Disclaimer