How to improve performance of non-scalar aggregations on denormalized tables

Posted by The Lazy DBA on Stack Overflow See other posts from Stack Overflow or by The Lazy DBA
Published on 2009-11-12T19:16:05Z Indexed on 2010/05/18 2:40 UTC
Read the original article Hit count: 401

Filed under:

indexing

Suppose we have a denormalized table with about 80 columns, and grows at the rate of ~10 million rows (about 5GB) per month. We currently have 3 1/2 years of data (~400M rows, ~200GB).

We create a clustered index to best suit retrieving data from the table on the following columns that serve as our primary key...

    [FileDate] ASC, 
    [Region] ASC,
    [KeyValue1] ASC, 
    [KeyValue2] ASC

... because when we query the table, we always have the entire primary key.

So these queries always result in clustered index seeks and are therefore very fast, and fragmentation is kept to a minimum. However, we do have a situation where we want to get the most recent FileDate for every Region, typically for reports, i.e.

    SELECT
     [Region]
    , MAX([FileDate]) AS [FileDate]
    FROM
     HugeTable
    GROUP BY
     [Region]

The "best" solution I can come up to this is to create a non-clustered index on Region. Although it means an additional insert on the table during loads, the hit isn't minimal (we load 4 times per day, so fewer than 100,000 additional index inserts per load). Since the table is also partitioned by FileDate, results to our query come back quickly enough (200ms or so), and that result set is cached until the next load.

However I'm guessing that someone with more data warehousing experience might have a solution that's more optimal, as this, for some reason, doesn't "feel right".

Developer IT

How to improve performance of non-scalar aggregations on denormalized tables - Developer IT

How to improve performance of non-scalar aggregations on denormalized tables

sql-server

sql-server-2005

sql

data-warehouse

indexing

Related posts about sql-server

SQL SERVER – Beginning SQL Server: One Step at a Time – SQL Server Magazine

How to install SQL Server 2005 Configuration Manager without installing SQL Server Management Studio

[MAJ] SQL Server 2005 Express Edition - La version gratuite de SQL Server 2005

How to create SQL Server Express DB from SQL Server DB

sql server 2000 error, error trying to connect to sql server 2005

Related posts about sql-server-2005

[MAJ] SQL Server 2005 Express Edition - La version gratuite de SQL Server 2005

SQLAuthority News – Feature Pack for Microsoft SQL Server 2005 SP4

SQL Server 2005 - Syncing development/production databases

Displaying the Row Number in a SELECT Query with SQL Server 2005

Displaying the Row Number in a SELECT Query with SQL Server 2005

Categories cloud