PowerPivot – Microsoft’s Answer to Self-Service Reporting COLLABORATIVE WHITEPAPER SERIES

PowerPivot – Microsoft’s Answer
to Self-Service Reporting
Microsoft’s Latest Foray in the Business Intelligence Arena
COLLABORATIVE WHITEPAPER SERIES
COLLABORATIVE WHITE PAPER SERIES:
PowerPivot – Microsoft’s Answer to Self-Service Reporting
In the last quarter of 2010, Microsoft first introduced PowerPivot into the Business Intelligence (BI)
market with the release of SQL Server 2008 R2. PowerPivot is the latest addition to the Microsoft BI
stack and is being touted by Microsoft as the answer to self-service BI.
So what exactly is PowerPivot? PowerPivot was initially, at the time of its launch, a free, downloadable
add-in (available from Microsoft) to Excel 2010. Today, with the latest version of Excel, it is still an
add-in but is now integrated in to the core product of both the professional plus version of Excel 2013
and the Office 365 professional plus version. One of the key features of PowerPivot is that it enables
an Excel user to bring in millions of rows of data from a variety of different data sources, join them
together and then create basic reports and charts in minutes. Virtually everyone has hit limits loading
data into Excel. With PowerPivot for Excel, a user can now load over 100 million rows of data very
quickly and get back aggregated results such as “total revenue,” on the fly, without the need for any
pre-calculated results.
To liken this to other capabilities in the BI marketplace, PowerPivot is an in-memory analytics tool like
QlikView. This is PivotTables on Steroids!
I. PowerPivot – What it is and what it isn’t
What it is
As stated above, PowerPivot is an add-in for Excel. Once installed (with Excel 2010) or
activated (with Excel 2013), PowerPivot appears as a menu option on the main Excel
ribbon. Clicking this option reveals the main PowerPivot menu as Figure 1 reveals.
PowerPivot enables a user to load and join data from a wide variety of sources –
Relational databases - such as SQL Server, DB2 or Oracle, Analysis Services cubes,
Reporting Services files, text files, atom data feeds, or any OLEDB/ODBC data source.
One of the key benefits with PowerPivot is that users can combine any of the above
data sources with data contained in an Excel Table. Each data source (including an
Excel Table) loads into a separate tab. Data can be sorted and filtered on the way
into PowerPivot without writing any “code” or SQL – all through a GUI interface!
It is important to note that using this GUI approach values that do not exist in the data
cannot be filtered out. The data brought in can be augmented by creating new Derived
Analysis Expressions (DAX).
2
Figure 1: PowerPivot in Excel
Most data warehouses or data marts do not contain all the
data sources that are needed across the organization and
so PowerPivot allows you to easily combine and link your
own Excel tables with these other sources—a feature that
is extremely important to virtually all business analysts.
With PowerPivot you can load and filter data on the way
into Excel without having to write a lick of SQL, as well as
filter data once it’s in. Different data sets can only (in this
version) be joined via a single column. If multiple columns
are needed to perform a join, a compound key could
be created within PowerPivot and then this key used for
the join. This is illustrated in Figure 2, which shows the
popup menu used to create a relationship, known as a
“join” in PowerPivot terminology.
of space needed to save an Excel file once it’s been loaded
with data. As the underlying engine is Analysis Services,
PowerPivot offers a vast array of date and time intelligence
functions and capabilities. With PowerPivot a user has
access to functions such as TOTALQTD, TOTALYTD,
SAMEPERIODLASTYEAR, NEXTMONTH, NEXTYEAR
and ENDOFMONTH.
PowerPivot also has special functions to deal with measures
that do not aggregate across time such as Inventory.
What can be done with the data once it’s in Excel? As stated
briefly above, new calculated columns of data can be added
in PowerPivot by using DAX. At first glance, DAX functions
look very similar to existing Excel functions, but the major
difference is that functions in Excel normally operate on
cells or ranges, whereas DAX functions operate on whole
columns, tables and measures, as well as relationally across
tables. There are numerous DAX functions that span the
following categories: filter, logical, text, aggregate, and date
and time intelligence. By default, the outcome of a DAX
formula would be based on the filter’s (Slicer) settings. For
example, if ‘Year’ was a slicer and ‘2008’ was selected and
you had two columns – ‘Revenue’ and ‘% of Total’ with
Products for Rows, the results of the table would show
product ‘Revenue’ and ‘% of Total for 2008’ for all products.
This can be overridden in cases where you may want to
show the ‘% of Total’ for all years and not just the Year or
Years selected. A few brief examples of DAX functions are:
Under the hood, PowerPivot creates a virtual Analysis
Services cube and uses Vertipaq (a compression technology
that Microsoft acquired) to drastically reduce the amount
Figure 2: PowerPivot “join”
3
COLLABORATIVE WHITE PAPER SERIES:
PowerPivot – Microsoft’s Answer to Self-Service Reporting
•SUM (Orders [OrderID]) - In this example, ‘Orders’
is the tab name (based on a table called orders) and
‘[OrderID]’ is the column name
organization, you need to use SharePoint 2010 or later (this
uses Excel Services and PowerPivot Galleries and Libraries
so that non PowerPivot users can view and interact with
the analysis without having PowerPivot on their desktops).
It should be noted that if a PowerPivot model is created
using Excel 2013, it cannot be shared to someone using
an earlier (Excel 2010) version.
•CALCULATE (SUM (‘Order Details’ [Sales Amount]),
ALL(Orders))
•RIGHT (Orders [OrderID], 2)
•RELATED (Products [Category])
Most organizations – even if they are Microsoft centric –
are normally slow to adopt the latest release (2013) of
Office and so will make the roll out and adoption of this
technology (other than on isolated PC’s) slower in the
market. Another important distinction is whether a user
has a 32 or 64 bit operating system (OS). Having a 64
bit OS allows a user to load in more rows and have faster
aggregations. In tests on a “normal” laptop using the 32
bit PowerPivot version (in Excel 2010), 3.7 million rows of
data were loaded in approximately five minutes. PowerPivot
had no problems with all totals appearing instantaneously.
After data has been brought in, all the necessary columns
added, and the joins made, there are several ways to
report on data:
•The main two ways are to create either PivotTables or
PivotCharts (an example of the type of reporting that
can be created is given at the end of this document).
•Write an SQL Server Reporting Services report on a
PowerPivot model published to SharePoint.
•Use the GETPIVOTDATA() function from Excel.
An example of this type of function would be
=GETPIVOTDATA(“[Measures].[Sum of Quantity]”,
$A$1, “[Products].[Category]”,“[Products].
[Category].&[Beverages]”).
Figure 3: Value fields
•Use Excel’s CUBE functions. An example of this type
of function would be =CUBEMEMBER(“PowerPivot
Data,” “[Measures].[Sales],” “Sales”).
•Use a third party tool such as Tableau.
Fortunately for the user, both the GETPIVOTDATA and
CUBEMEMBER functions are self-generating once a user
types the “=” sign in a cell.
Saving a PowerPivot analysis is just like saving an Excel file
and on the face of it seems like just an ordinary Excel file,
but by changing the extension to be .zip instead of .xlsx, the
difference becomes clear. If you open this file with a viewer
like WinZip, you will see that in fact there are several
folders, and by exploring one of them – ‘xl,’ ‘CustomData’ –
you will see the size of the compressed data file.
The term “free” was used earlier, but there is a hidden
cost to PowerPivot. First, you either need to upgrade to
Excel 2010 or Excel 2013, and secondly, in order to share
any information created using PowerPivot around the
4
Even outside of PowerPivot, Excel 2010 and later has
some interesting PivotTable features such as slicers (filters)
that allow a user to easily filter data. Slicers can be used
when creating any PowerPivot Analysis to enable a user to
very easily filter the data that they need. Another PivotTable
feature that is extremely useful, and again worthy of note,
enables the user to select a value field such as “Revenue,”
place this in the data area, and then drag the field again
into the data area and convert it to a variety of formats
such as ‘% of Total’ or “Rank.” This feature can create
a multitude of typical calculations used in reporting in a
few simple mouse clicks, whereas in other BI tools this
involves creation of formulas and utilizing various functions
within the various tools—no easy task.
semantic layer that can be shared across the organization
and security. A semantic layer is one element that many BI
tools provide (for example in BusinessObjects, this layer is
called a Universe, and in Cognos, it is called Framework
Manager Model) that shields a user from the joins and
underlying table structures. Another gap lies within the
security capabilities of PowerPivot. The only security
offered is by either SharePoint or the user account for the
underlying database access. Since its launch, Microsoft has
partially addressed the security issue and the semantic layer
issue by allowing a PowerPivot model to be deployed to a
Tabular Analysis Services model on a server. This model can
then have:
•Role-based security model using Active Directory
This feature is illustrated in Figure 4. In this example,
‘Internet Sales Amount’ was dragged into the data area
twice. By right clicking and selecting ‘Show Values As,’
a sub menu showing all the different options appears.
•Row and column level security to secure data at the
deepest level
Without SharePoint 2010 and beyond, the sharing of
analyses created using PowerPivot are going to be limited
to those who have PowerPivot installed on their local
machines. Another area worth mentioning, where caution
should be taken, is the use of date and time intelligence
functions. Some functions will only return correct or intuitive
results if the dates are contiguous. To get around this issue,
the creation of a separate date or time dimension with
continuous dates is recommended.
By selecting ‘% of Column Total’ the values appear as
follows, automatically formatted.
This new column can easily be renamed to a more
appropriate name.
What it isn’t
What are the gaps or cons of using PowerPivot? One of the
most important elements missing from this tool is a robust
Figure 4: Selecting value fields
II. Summary
Most business people conduct reporting in Excel – and it is
especially important to those in the finance function. When
Collaborative Consulting’s professionals introduce a new BI
tool to a client, one of the most frequently asked questions
is “can I export that to Excel?” The main reasons this may
not yet be possible are:
•The organization may not have a data warehouse,
and therefore the data needed for reporting is
contained in numerous places.
5
COLLABORATIVE WHITE PAPER SERIES:
PowerPivot – Microsoft’s Answer to Self-Service Reporting
•The organization has a data warehouse, but the
warehouse does not contain all sources necessary
for all reporting.
warehouse. Warehouses typically automate, standardize
and cleanse data as well as capture certain attributes
so that changes over time can be captured and reported
correctly, for example capturing sales force changes and
organizational hierarchies. These types of things are much
harder to do in PowerPivot. PowerPivot empowers a user
with the ability to combine this with other local data to
complete reporting without concern for size or row limits.
•The reporting tools typically offered by BI vendors
allow for simple report generation, but typically,
more complex reports are needed by the business
and therefore require more of an IT developer’s
skill set – where things like year over year percent
variance calculations are required.
Most BI vendors, and especially those in the in-memory
analytics category such as QlikView, will view Microsoft
as a threat. The only thing that will slow down adoption
in the market is the fact that Office and SharePoint 2010
(or later) will be necessary in order for users to be able
to share analysis (although it should be noted that only
Office 2010 or later is needed for standalone reporting).
The latest version of Excel 2013 with all the new features
such as Quick Analysis, Power Maps (allowing excellent
geospatial analysis), Power Query and built in office “apps”
to make everyday tasks in Excel even easier, make Excel a
formidable self-service tool.
For these reasons, many business analysts use tools
like Excel and Access to load, manipulate and report on
data from multiple sources. Historically, Access has been
chosen over Excel because of initial row limits or memory
limits with Excel. Access also has a limit on the size of
the database it can handle. With PowerPivot, a user has
the ability to load massive amounts of data – over 100
million rows – into Excel and do their reporting, thus
enabling business users to do their jobs with greater ease
and self-sufficiency.
Even though some entire data warehouses could now be
loaded into Excel, there will always be a need for a data
Figure 5: An example of the reporting that can be created in Excel. All of Excel’s features can be applied and numerous chart types and templates are available.
6
COLLABORATIVE WHITE PAPER SERIES:
PowerPivot – Microsoft’s Answer to Self-Service Reporting
Collaborative Consulting is a leading information
technology services firm dedicated to helping
our clients achieve business advantage through
the use of strategy and technology. We deliver a
comprehensive set of solutions across multiple
industries, with a focus on business process and
program management, information management,
software solutions, and software performance and
quality. We also have a set of offerings specific to
the life sciences and financial services industries.
Our unique model offers both onsite management
and IT consulting as well as U.S.-based remote
solution delivery.
To learn more about Collaborative, please
visit our website at www.collaborative.com,
email us at [email protected], or
contact us at 877-376-9900.
Copyright © 2014 Collaborative Consulting, LLC. All rights reserved. This product
is protected by U.S. and international copyright and intellectual property laws.
WP.614.24