HTTP Samples

These samples are using a trace from HTTP.sys - the kernel component of Microsoft's web server "Internet Information Services" (IIS):

HTTP_Samples.JPG

Here IIS was used to serve two pieces of static content:
  • helloworld.htm - a small HTML file
  • windir.txt - big text file
The following is a quick walk through of the samples:

Request Parsed

When IIS receives new http request, it traces event of type Parse (id=2). This event contains the requested Url. Each request has also an ActivityId - a special identifier that has the same value for all events belonging to the same request.

With the query 1_RequestParsed, we display jus the Url and the Activity Id:

HTTP_1_RequestParsed.JPG

Note here that if we drag-drop "Parse" from the tree, it shows with long namespace name.

To make it as simple as "Parse" we used LINQPad's feature to add namespaces
  • Press F4, "Additional Namespace imports"
  • Or select Query->QueryProperties from the menus

Http Status

When IIS parsed incoming request as above, it did not know the outcome yet. The status becomes clear from subsequent event:

HTTP_2_HttpStatus.JPG

Note here:
  • Status 304 means "not modified", which is success code for retrieving static content
  • Events have the same ActivityID-s, so they can be correlated to the start events

Whole Request

The query 3_WholeRequest.JPG correlates the start and end events:

HTTP_3_WholeRequest.JPG

Here:
  • we give short names to the begin and end streams (note the Language is now "C# statements")
  • for each begin event we want one end event, correlated on ActivityID
  • once this event arrives, we want to produce event at output and discard all the memory state we are keeping

This query is example of LINQ statement that will work on infinitely large file. The reason is that we don't keep all the events in-memory. We only keep begin events for which we have not seen the end yet.

If for example IIS handled 1000 requests per second, and requests usually finish within 10 ms, then we will keep just 10 events of type Parse.

See also the virtual time query at the end.

Aggregate Duration

In the query 4_AggregateDuration we show how Rx can be used for aggregation:

HTTP_4_AggregateDuration.JPG

Here we start from the output of the previous query, and group events by Url and the duration rounded in buckets of 0.1 milliseconds.

Here "statistics" represents a stream of aggregated buckets. By doing playback.BufferOutput(statistics) we wait till the end of the processing and maintain the result in small in-memory list. As we can see here are just 7 items.

The statement "orderby s.Milliseconds, s.Url select s" is using LINQ to Objects to sort this small data set.

Finally, clicking on the icon in the Count column title expands the histogram to the right.

Slow Requests

From the previous histogram we can tell that there are two outliers - the small file helloworld.htm was retrieved slower than the big .txt file. Let's formulate a query to find these requests:

HTTP_5_SlowRequests.JPG

This got us the answer, but at the expense of another read of the file.

Single Pass

Now we are going to use a single read, to answer both queries on one single read of the file:

HTTP_6_SinglePass.JPG

Note here:
  • doing Publish() returns IConnectableObservable. This is the Rx primitive for forking the stream.
  • doing requests.Connect() is necessary to enable the event flow.
  • playback.BufferAllOutput is safe, because we know the output is small

Here is the output:

HTTP_6_SinglePass_Output.JPG

Virtual Time

Finally, all the queries so far were about the whole file. But what if we want to do temporal query?

The following query aggregates the events in 5 sec windows as per virtual time obtained from the event timestamps:

HTTP_7_WindowInVirtualTime.JPG

The query you write in LINQPad can be simply cut-pasted in Visual Studio. See also, the description of Virtual Time in the Playback samples

Last edited Mar 4, 2013 at 10:53 PM by georgis, version 5

Comments

No comments yet.