Data granularity and retention

Aggregation of data values

The Monitor Service collects various data, including user session usage, user logon performance details, session load balancing details, and connection and machine failure information. Data is aggregated differently depending on its category. Understanding the aggregation of data values presented using the OData Method APIs is critical to interpreting the data. For example:

Connected Sessions and Machine Failures occur over a period. Therefore, they are exposed as maximums over a time period.

Logon Duration is a measure of the length of time, therefore is exposed as an average over a time period.

Logon Count and Connection Failures are counts of occurrences over a period, therefore are exposed as sums over a time period.

Concurrent data evaluation

Your sessions must be overlapping to be considered concurrent. However, when the time interval is 1 minute, all sessions in that minute (whether they overlap) are considered concurrent. The size of the interval is so small that the performance overhead involved in calculating the precision is not worth the value added. If the sessions occur in the same hour, but not in the same minute, they are not considered to overlap.

Correlation of summary tables with raw data

The data model represents metrics in two different ways:

The summary tables represent aggregate views of the metrics in per minute, hour, and day time granularities.

The raw data represents individual events or current state tracked in the session, connection, application, and other objects.

When attempting to correlate data across API calls or within the data model itself, it is important to understand the following concepts and limitations:

No summary data for partial intervals. Metrics summaries are designed to meet the needs of historical trends over long periods. These metrics are aggregated into the summary table for complete intervals. There is no summary data for a partial interval at the beginning (oldest available data) of the data collection nor at the end. When viewing aggregations of a day (Interval=1440), this means that the first and most recent incomplete days have no data. Although raw data might exist for those partial intervals, it is never summarized. Pull the min and max SummaryDate from a particular summary table to determine the earliest and latest aggregate interval for a particular data granularity. The SummaryDate column represents the start of the interval. The Granularity column represents the length of the interval for the aggregate data.

Correlating by time. Metrics are aggregated into the summary table for complete intervals as described in the preceding section. They can be used for historical trends, but raw events might be more current in the state than what has been summarized for trend analysis. Any time-based comparison of summary to raw data must take into account that there is no summary data for partial intervals that might occur or for the beginning and ending of the time period.

Missed and latent events. Metrics that are aggregated into the summary table might be slightly inaccurate if events are missed or latent to the aggregation period. Although the Monitor Service attempts to maintain an accurate current state, it does not go back in time to recompute aggregation in the summary tables for missed or latent events.

Connection High Availability. During connection HA, there are gaps in the summary data counts of current connections, but the session instances are still running in the raw data.

Data retention periods. Data in the summary tables is retained on a different grooming schedule from the schedule for raw event data. Data might be missing because it has been groomed away from summary or raw tables. Retention periods might also differ for different granularities of summary data. Lower granularity data (minutes) is groomed more quickly than higher granularity data (days). If data is missing from one granularity due to grooming, it might be found in a higher granularity. Since the API calls only return the specific granularity requested, receiving no data for one granularity does not mean that the data doesn’t exist for a higher granularity for the same time period.

Time zones. Metrics are stored with UTC time stamps. Summary tables are aggregated on hourly time zone boundaries. For time zones that don’t fall on hourly boundaries, there might be some discrepancy as to where data is aggregated.

Granularity and retention

The granularity of aggregated data retrieved by Monitor is a function of the time (T) span requested. The rules are as follows:

0 < T <= 30 days use per-hour granularity

T > 31 days use per-day granularity

Requested data that does not come from aggregated data comes from the raw Session and Connection information. This data tends to grow fast, and therefore has its own grooming setting. Grooming ensures that only relevant data is kept long term. This ensures better performance while maintaining the granularity required for reporting.

#	Setting name	Schema table impacted	Tables and Chart impacted in Monitor pages	Retention days for Premium	Retention days for Advanced
1	GroomSessionsRetentionDays	MonitorData.Session and Monitordata.Connection tables	This setting impacts session details, Logon Duration by User Session, Application Based Usage tables on the Trends page.	90	31
2	GroomFailuresRetentionDays	MonitorData.MachineFailureLog and MonitorData.ConnectionFailureLog	Trends page: This setting impacts charts and tables on failures tab.	90	31
3	GroomLoadIndexesRetentionDays	MonitorData.LoadIndex	This setting impacts the data displayed in the “Load Evaluator Index” tab on the Trends page.	3	3
4	GroomDeletedRetentionDays	MonitorData.Machine, MonitorData.Catalog, MonitorData.DesktopGroup, and MonitorData.Hypervisor entities that have a LifecycleState of ‘Deleted’. This setting also deletes any related Session, SessionDetail, Summary, Failure, or LoadIndex records.	Machine, Catalog, DesktopGroup, and Hypervisor entities that have a LifecycleState of ‘Deleted’. This setting also deletes any related Session, SessionDetail, Summary, Failure, or LoadIndex records.	90	31
5	GroomSummariesRetentionDays	MonitorData.DesktopGroupSummary, MonitorData.FailureLogSummary, and MonitorData.LoadIndexSummary	This setting impacts all chart data on the Trends page.	365	31
6	GroomMachineHotfixLogRetentionDays	MonitorData.Hotfix	This setting impacts the VDA hotfix data shown on the Machine Details page.	90	31
7	GroomHourlyRetentionDays	all Summary tables	This impacts the weekly graphs shown on the Trends page.	32	31
8	GroomApplicationInstanceRetentionDays	MonitorData.ApplicationInstance	This setting impacts the chart and tables on the Capacity management tab and application usage tables on the Trends page.	90	Not applicable
9	GroomNotificationLogRetentionDays	MonitorData.NotificationLog	This setting impacts the Alerts shown on Monitor.	90	Not applicable
10	GroomResourceUsageRawDataRetentionDays	MonitorData.Resourceutilization	This setting impacts CPU and Memory charts seen in “Historical Machine Utilization” Machine Details page and the data calculation on Cost optimization area “Workload rightsizing tab”.	3	3
11	GroomResourceUsageHourDataRetentionDays	MonitorData.Resourceutilizationsummary	This setting impacts CPU and Memory charts seen in “Historical Machine Utilization” Machine Details page and the data calculation on Cost optimization area “Workload rightsizing tab”.	30	30
12	GroomResourceUsageDayDataRetentionDays	MonitorData.Resourceutilizationsummary	This setting impacts the CPU and Memory chart seen on Machine “Resource utilization” on Trends page and “Machine utilization” page for specific machine.	365	31
13	GroomProcessUsageRawDataRetentionDays	MonitorData.ProcessUtilization	This setting impacts the resource trend per process information shown on machine historical usage page.	1	1
14	GroomProcessUsageHourDataRetentionDays	MonitorData.ProcessUtilizationHourSummary	This setting impacts the CPU and Memory usage trend per process shown on the machine historical usage page.	7	7
15	GroomProcessUsageDayDataRetentionDays	MonitorData.ProcessUtilizationDaySummary	This setting impacts the CPU and Memory usage trend per process shown on the machine historical usage page.	30	30
16	GroomSessionMetricsDataRetentionDays	MonitorData.Sessionmetrics	This setting impacts all the charts seen on the “Session Performance” tab of the user details page.	1	1
17	GroomMachineMetricDataRetentionDays	MonitorData.Machinemetrics	This setting impacts the chart and table on the “Resource Utilization” tab on the Trends page.	3	3
18	GroomMachineMetricDaySummaryDataRetentionDays	MonitorData.MachineMetricDaySummary	This setting impacts the chart and table on the “Resource Utilization” tab on the Trends page.	365	31
19	GroomApplicationErrorsRetentionDays	MonitorData.ApplicationError	This setting impacts error details shown in the “Application Errors” column on the Applications page.	1	1
20	GroomApplicationFaultsRetentionDays	MonitorData.Applicationfailure	This setting impacts the “Application Faults” column on the Applications page.	1	1

Setting name

Schema table impacted

Tables and Chart impacted in Monitor pages

Retention days for Premium

Retention days for Advanced

GroomSessionsRetentionDays

MonitorData.Session and Monitordata.Connection tables

This setting impacts session details, Logon Duration by User Session, Application Based Usage tables on the Trends page.

GroomFailuresRetentionDays

MonitorData.MachineFailureLog and MonitorData.ConnectionFailureLog

Trends page: This setting impacts charts and tables on failures tab.

GroomLoadIndexesRetentionDays

MonitorData.LoadIndex

This setting impacts the data displayed in the “Load Evaluator Index” tab on the Trends page.

GroomDeletedRetentionDays

MonitorData.Machine, MonitorData.Catalog, MonitorData.DesktopGroup, and MonitorData.Hypervisor entities that have a LifecycleState of ‘Deleted’. This setting also deletes any related Session, SessionDetail, Summary, Failure, or LoadIndex records.

Machine, Catalog, DesktopGroup, and Hypervisor entities that have a LifecycleState of ‘Deleted’. This setting also deletes any related Session, SessionDetail, Summary, Failure, or LoadIndex records.

GroomSummariesRetentionDays

MonitorData.DesktopGroupSummary, MonitorData.FailureLogSummary, and MonitorData.LoadIndexSummary

This setting impacts all chart data on the Trends page.

365

GroomMachineHotfixLogRetentionDays

MonitorData.Hotfix

This setting impacts the VDA hotfix data shown on the Machine Details page.

GroomHourlyRetentionDays

all Summary tables

This impacts the weekly graphs shown on the Trends page.

GroomApplicationInstanceRetentionDays

MonitorData.ApplicationInstance

This setting impacts the chart and tables on the Capacity management tab and application usage tables on the Trends page.

Not applicable

GroomNotificationLogRetentionDays

MonitorData.NotificationLog

This setting impacts the Alerts shown on Monitor.

Not applicable

GroomResourceUsageRawDataRetentionDays

MonitorData.Resourceutilization

This setting impacts CPU and Memory charts seen in “Historical Machine Utilization” Machine Details page and the data calculation on Cost optimization area “Workload rightsizing tab”.

GroomResourceUsageHourDataRetentionDays

MonitorData.Resourceutilizationsummary

This setting impacts CPU and Memory charts seen in “Historical Machine Utilization” Machine Details page and the data calculation on Cost optimization area “Workload rightsizing tab”.

GroomResourceUsageDayDataRetentionDays

MonitorData.Resourceutilizationsummary

This setting impacts the CPU and Memory chart seen on Machine “Resource utilization” on Trends page and “Machine utilization” page for specific machine.

365

GroomProcessUsageRawDataRetentionDays

MonitorData.ProcessUtilization

This setting impacts the resource trend per process information shown on machine historical usage page.

GroomProcessUsageHourDataRetentionDays

MonitorData.ProcessUtilizationHourSummary

This setting impacts the CPU and Memory usage trend per process shown on the machine historical usage page.

GroomProcessUsageDayDataRetentionDays

MonitorData.ProcessUtilizationDaySummary

This setting impacts the CPU and Memory usage trend per process shown on the machine historical usage page.

GroomSessionMetricsDataRetentionDays

MonitorData.Sessionmetrics

This setting impacts all the charts seen on the “Session Performance” tab of the user details page.

GroomMachineMetricDataRetentionDays

MonitorData.Machinemetrics

This setting impacts the chart and table on the “Resource Utilization” tab on the Trends page.

GroomMachineMetricDaySummaryDataRetentionDays

MonitorData.MachineMetricDaySummary

This setting impacts the chart and table on the “Resource Utilization” tab on the Trends page.

365

GroomApplicationErrorsRetentionDays

MonitorData.ApplicationError

This setting impacts error details shown in the “Application Errors” column on the Applications page.

GroomApplicationFaultsRetentionDays

MonitorData.Applicationfailure

This setting impacts the “Application Faults” column on the Applications page.

Caution:

You cannot modify the values on the Monitor Service database.

Retaining data for long periods has the following implications on table sizes:

Hourly data. If hourly data is allowed to stay in the database for up to two years, a site of 1000 delivery groups can cause the database to grow as follows:

1000 delivery groups x 24 hours/day x 365 days/year x 2 years = 17,520,000 rows of data. The performance impact of such a large amount of data in the aggregation tables is significant. Given that the dashboard data is drawn from this table, the requirements on the database server might be large. Excessively large amounts of data can have a dramatic impact on performance.

Session and event data. This is the data that is collected every time a session is started and a connection/reconnection is made. For a large site (100 K users), this data grows fast. For example, two years’ worth of these tables would gather more than a TB of data, requiring a high-end enterprise-level database.

Data granularity and retention

Aggregation of data values

Concurrent data evaluation

Correlation of summary tables with raw data

Granularity and retention

In this article