"Administrator's Guide": Configuring the Server for Performance

Performance Report
The server statistics in buckets can be accessed using The perfdump Utility. The performance buckets information is located in the last section of the report that perfdump returns. To enable reports on performance buckets, complete the following steps:

Define an extension for the performance bucket report. Add the following line to the mime.types file:

type=perf exts=perf

Associate the type you declared in mime.types with the service-dump function in the obj.conf file:

Service fn=service-dump type=perf

Use the URL http://server_name:port_number/.perf to view the performance report.

Note. You must include a period (.) before the extension you defined in the mime.types file (in this case, .perf).

The report contains the following information:

Average column shows per request statistics.

Request processing time is the total time the required by the server to process all the requests it has received so far. Even on the busiest of the servers this number will be very small compared to the server uptime.

Number of Requests is the total number of requests for the function.

Number of Invocations is the total number that the function was invoked. This differs from the number of requests in that a function could be called multiple times while processing one request. The percentage column for this row is calculated in reference to the total number of invocations for all the buckets.

Latency (in seconds) The time iPlanet Web Server takes to prepare for calling "send-cgi."

Function Processing Time (in seconds) The percentage of Function Processing Time and Total Response Time is calculated with reference to the total Request processing time. (The time spent in "send-cgi" (that is, the time required to fork/exec the CGI program plus the execution time of the program itself.)

Total Response Time (in seconds) The sum of Function Processing Time and Latency.

Percent column displays of Number of Requests is calculated with reference to the Total number of requests.

The following is an example of the performance bucket information available through perfdump:

Performance Counters:

------------------------------------------------

Server start time: Mon Oct 11 15:37:26 1999

Average Total Percent

Total number of requests: 474851

Request processing time: 0.0010 485.3198

Cache Bucket (cache-bucket)

Number of Requests: 474254 ( 99.87%)

Number of Invocations: 474254 ( 98.03%)

Latency: 0.0001 48.7520 ( 10.05%)

Function Processing Time: 0.0003 142.7596 ( 29.42%)

Total Response Time: 0.0004 191.5116 ( 39.46%)

Default Bucket (default-bucket)

Number of Requests: 597 ( 0.13%)

Number of Invocations: 9554 ( 1.97%)

Latency: 0.0000 0.1526 ( 0.03%)

Function Processing Time: 0.0256 245.0459 ( 50.49%)

Total Response Time: 0.0257 245.1985 ( 50.52%)

File and Accelerator Caches

In iPlanet Web Server there are two caches: a front-end accelerator cache that caches response headers and contains pointers to the static file cache, and a static file cache which holds static file information as well as content. The cache-init directive initializes the accelerator cache. The file cache is turned on by default. If you want to change the default cache setup, you need to create a file called nfsc.conf. For more information, see Configuring the File Cache.

The file cache is implemented using a new file cache module, NSFC, which caches static HTML, image and sound files. In previous versions of the server, the file cache was integrated with the accelerator cache for static pages. Therefore, an HTTP request was serviced by the accelerator, or passed to the NSAPI engine for full processing, and requests that could not be accelerated did not have the benefit of file caching. This prevented many sites with NSAPI plug-ins, customized logs, or used server-parsed HTML from taking advantage of the accelerator.

The NSFC module implements an independent file cache used in the NSAPI engine to cache static files that could not be accelerated. It is also used by the accelerator cache, replacing its previously integrated file cache. NSFC caches information that is used to speed up processing of server-parsed HTML.

Configuring the Accelerator Cache
The cache-init function controls the accelerator caching. To optimize server speed, you should have enough RAM for the server and cache because swapping can be slow. Do not allocate a cache that is greater in size than the amount of memory on the system.

Table 10.1 The cache-init parameters


disable	(optional) Specifies whether the file cache is disabled or not. If set to anything but "false" the cache is disabled. By default, the cache is enabled.
MaxNumberOfCachedFiles	(optional) Maximum number of entries in the accelerator cache. The default is 4096, minimum is 32, maximum is 32768.
MaxNumberOfOpenCachedFiles	(optional) Maximum number of accel_file_cache entries with file_cache entries. Default is 512, minimum is 32, maximum is 32768.
CacheHashSize	(optional) size of hash table for the accelerator cache. Default is 8192, minimum is 32, maximum is 32768.
Reaper	(optional) Deletes old references to NSFC entries. The accelerator file cache contains references to entries in the static NSFC file cache. If set to "on", the Reaper deletes references to NSFC entries marked for deletion. Default is "on".
ReaperInterval	(optional) Seconds to wait before deleting old static file cache reference entries. Default is 3600 seconds. When set to 0, the reaper is disabled.
MaxFilesToReap	(optional) Maximum number of old static file cache reference entries to delete during each round of reaping. The default is 50. When set to 0, the reaper is disabled.
NoOverflow	(optional) IRIX only.
IsGlobal	(optional) IRIX only.

Example
Init fn="cache-init" MaxNumberOfCachedFiles=15000 MaxNumberOfOpenCachedFiles=15000 CacheHashSize=15101 Reaper=on ReaperInterval=3600 MaxFilesToReap=50

Using the Reaper Parameters
The accelerator file cache contains references to entries in the static NSFC file cache. During the validation of the accelerator cache entry for a request, an NFSC entry may be marked for deletion if it's past its maximum age and the file has been changed. The accelerator file cache releases the old NSFC cache entry so that a new entry can be added. However, if your file cache size is not large enough, an NSFC entry may be marked for deletion to make room for a new entry. In this situation, if the entry marked for deletion has a reference in the accelerator file cache and no request for the entry ever comes again, the NSFC is not able to free the entry marked for deletion. In this situation, you can use Reaper to delete these entries.

Because this situation is relatively rare, set your ReaperInterval to an appropriately long interval, and your MaxFilesToReap to a small value.

Depending upon how the file and accelerator caches are configured and the load on the server, you may experience performance impact because of lock contention with request threads while reaping. Different platforms may respond to this contention differently. On some platforms (for example, Compaq Tru64 Unix 4.0d), under certain stressed configurations and heavy loads, this contention may cause more request threads to be created and hence more memory usage. In this case you need to disable Reaper or adjust the configuration or system resources.

Configuring the File Cache
You configure the file cache in a text file nsfc.conf. You can also turn off caching for a specific directory by using the parameter nocache in the obj.conf file.

Configuring nsfc.conf
By default, the file cache is turned on and uses the default values for all parameters described below. If you would like to change parameter values, you need to create a text file called nsfc.conf in the server_root/https-server-id/config directory. To change a parameter value for improved performance, type the parameter and its new value in the nsfc.conf file.

CopyFiles

When CopyFiles is set to "true," the file is copied to a temporary file which the server displays when users access the file. Defaults to "false" on Unix/Linux and "true" on Windows NT. The temporary file is stored in the directory specified in TempDir.

CopyFiles=true

FileCacheEnable

Specifies whether the file cache is enabled or not.

By default, this is set to true.

FileCacheEnable=true

HashInitSize

The size of the file cache hash table. The default size is 2 * MaxFiles +1. For example, if your MaxFiles is set to 1024, the default HashInitSize is 2049.

HashInitSize=9131

HitOrder

If the MaxFiles limit has been reached when the server creates a new file cache entry, the server marks an existing entry for deletion. If HitOrder is set to "true" the file entry marked for deletion is the one that has received the fewest hits. If HitOrder is set to "false" the file entry marked for deletion is the one that has gone the longest without a hit.

HitOrder=true

MaxAge

The maximum age (in seconds) of a valid cache entry. This setting controls how long cached information will continue to be used once a file has been cached. An entry older than MaxAge is replaced by a new entry for the same file if the same file is referenced through the cache.

Set MaxAge based on whether the content is updated (existing files are modified) on a regular schedule or not. For example, if content is updated four times a day at regular intervals, MaxAge could be set to 21600 seconds (6 hours). Otherwise, consider setting MaxAge to the longest time you are willing to serve the previous version of a content file, after the file has been modified.

By default, this is set to 30.

MaxAge=30

MaxFiles

The maximum number of files that may be in the cache at once.

By default, this is set to 1024.

MaxFiles=1024

MediumFileSizeLimit (Unix/Linux)

The size (in bytes) of the largest file that is not a "small" file that is considered to be "medium" size. The contents of medium files are cached by mapping the file into virtual memory (currently only on Unix/Linux platforms). The contents of "large" files (larger than "medium") are not cached, although information about large files is cached.

By default, this is set to 525000 (525 KB).

MediumFileSizeLimit=525000

MediumFileSpace

The size (in bytes) of the virtual memory used to map all medium sized files.

By default, this is set to 10000000 (10MB).

MediumFileSpace=10000000

SmallFileSizeLimit (Unix/Linux)

The size (in bytes) of the largest file considered to be "small." The contents of "small" files are cached by allocating heap space and reading the file into it.

The idea of distinguishing between small files and medium files is to avoid wasting part of many pages of virtual memory when there are lots of small files. So the SmallFileSizeLimit would typically be a slightly lower value than the VM page size.

By default, this is set to 2048.

SmallFileSizeLimit=2048

SmallFileSpace

The size of heap space (in bytes) used for the cache, including heap space used to cache small files.

By default, this is set to 1MB for Unix/Linux, 0 for Windows NT.

SmallFileSpace=1000000

TempDir

TempDir sets the directory name where the temporary files are copied if CopyFiles is set to "true." Defaults to system_temp_dir/netscape/server_instance.

If you assign a temporary directory, the server creates a structure within that directory for the temporary files. For example, on Windows NT if you set the temporary directory to C:/mytemp, the temporary files are created in the file C:/mytemp/c/server_doc_root. The c directory comes from the drive letter.

TempDir=C:/temp

TransmitFile

When TransmitFile is set to "true," open file descriptors are cached for files in the file cache, rather than the file contents, and PR_TransmitFile is used to send the file contents to a client. When set to "true," the distinction normally made by the file cache between small, medium, and large files no longer applies, since only the open file descriptor is being cached. By default, TransmitFile is "false" on Unix/Linux and "true" on Windows NT.

This directive is intended to be used on Unix/Linux platforms that have native OS support for PR_TransmitFile, which currently includes HPUX and AIX. It is not recommended for other Unix/Linux platforms.

TransmitFile=true

Using the nocache Parameter
You can use the parameter nocache for the Service function send-file to specify that files in a certain directory not be cached. For example, if you have a set of files that changes too rapidly for caching to be useful, you can put them in a directory and instruct the server not to cache files in that directory.

For example:

...

NameTrans fn="pfx2dir" from="/myurl" dir="/export/mydir", name="myname"

...

Service method=(GET|HEAD|POST) type=*~magnus-internal/* fn=send-file

...

</Object>

Service method=(GET|HEAD) type=*~magnus-internal/* fn=send-file nocache=""

</Object>

In the above example, the server does not cache static files from /export/mydir/ when requested by the URL prefix /myurl

File Cache Dynamic Control and Monitoring
An object can be added to obj.conf to enable the NSFC file cache to be dynamically monitored and controlled while the server is running. Typically this would be done by first adding a NameTrans directive to the "default" object:

NameTrans fn="assign-name" from="/nsfc" name="nsfc"

Then add a new object definition:

<Object name="nsfc">
Service fn=service-nsfc-dump
</Object>

This enables the file cache control and monitoring function (nsfc-dump) to be accessed via the URI, "/nsfc." By changing the "from" parameter in the NameTrans directive, a different URI can be used.

The following is an example of the information you receive when you access the URI:

iPlanet Web Server File Cache Status (pid 7960) The file cache is enabled. Cache resource utilization Number of cached file entries = 1039 (112 bytes each, 116368 total bytes) Heap space used for cache = 237641/1204228 bytes Mapped memory used for medium file contents = 5742797/10485760 bytes Number of cache lookup hits = 435877/720427 ( 60.50 %) Number of hits/misses on cached file info = 212125/128556 Number of hits/misses on cached file content = 19426/502284 Number of outdated cache entries deleted = 0 Number of cache entry replacements = 127405 Total number of cache entries deleted = 127407 Number of busy deleted cache entries = 17 Parameter settings HitOrder: false CacheFileInfo: true CacheFileContent: true TransmitFile: false MaxAge: 30 seconds MaxFiles: 1024 files SmallFileSizeLimit: 2048 bytes MediumFileSizeLimit: 537600 bytes CopyFiles: false Directory for temporary files: /tmp/netscape/https-axilla.mcom.com Hash table size: 2049 buckets

You can include a query string = when you access the "/nsfc" URI. The following values are recognized:

?list - Lists the files in the cache.

?refresh=n - Causes the client to reload the page every n seconds.

?restart - Causes the cache to be shut down and then started.

?start - Starts the cache.

?stop - Shuts down the cache.

If you choose the ?list option, the file listing includes the file name, a set of flags, the current number of references to the cache entry, the size of the file, and an internal file ID value. The flags are as follows:

C - File contents are cached.

D - Cache entry is marked for delete.

E - PR_GetFileInfo() returned an error for this file.

I - File information (size, modify date, etc.) is cached.

M - File contents are mapped into virtual memory.

O - File descriptor is cached (when TransmitFile is set to true).

P - File has associated private data (should appear on shtml files).

T - Cache entry has a temporary file.

W - Cache entry is locked for write access.

For sites with scheduled updates to content, consider shutting down the cache while the content is being updated, and starting it again after the update is complete. Although performance will slow down, the server operates normally when the cache is off.

Unix/Linux Platform-Specific Issues

The various Unix/Linux platforms all have limits on the number of files that can be open in a single process at one time. For busy sites, increase that number to 1024.

Solaris: in /etc/system, set rlim_fd_max, and reboot.

AIX: run smit and check the kernel tuning parameters.

HP-UX: run sam and check the kernel tuning parameters.

These Unix platforms have proprietary sites for additional information about tuning their systems for web servers:

AIX - http://www.rs6000.ibm.com/resource/technology/sizing.html

IRIX - http://www.sgi.com/tech/web/

Compaq Tru64 Unix - http://www.unix.digital.com/internet/tuning.htm

SUN - http://www.sun.com/sun-on-net/performance/book2ref.html

Tuning Solaris for Performance Benchmarking
The following table shows the operating system tuning for Solaris used when benchmarking for performance and scalability. These values are an example of how you might tune your system to achieve the desired result.

Table 10.2 Tuning Solaris for performance benchmarking


Parameter	Scope	Default Value	Tuned Value	Comments
rlim_fd_max	/etc/system	1024	8192	Process open file descriptors limit; should account for the expected load (for the associated sockets, files, pipes if any).
rlim_fd_cur	/etc/system	64	8192
sq_max_size	/etc/system	2	0	Controls streams driver queue size; setting to 0 makes it infinity so the performance runs wont be hit by lack of buffer space. Set on clients too.
tcp_close_wait_interval	ndd /dev/tcp	240000	60000	Set on clients too.
tcp_time_wait_interval	ndd /dev/tcp	240000	60000	For Solaris 7 only. Set on clients too.
tcp_conn_req_max_q	ndd /dev/tcp	128	1024
tcp_conn_req_max_q0	ndd /dev/tcp	1024	4096
tcp_ip_abort_interval	ndd /dev/tcp	480000	60000
tcp_keepalive_interval	ndd /dev/tcp	7200000	900000	For high traffic web sites lower this value.
tcp_rexmit_interval_initial	ndd /dev/tcp	3000	3000	If retransmission is greater than 30-40%, you should increase this value.
tcp_rexmit_interval_max	ndd /dev/tcp	240000	10000
tcp_rexmit_interval_min	ndd /dev/tcp	200	3000
tcp_smallest_anon_port	ndd /dev/tcp	32768	1024	Set on clients too.
tcp_slow_start_initial	ndd /dev/tcp	1	2	Slightly faster transmission of small amounts of data.
tcp_xmit_hiwat	ndd /dev/tcp	8129	32768	To increase the transmit buffer.
tcp_recv_hiwat	ndd /dev/tcp	8129	32768	To increase the receive buffer.

Tuning HP-UX for Performance Benchmarking
The following table shows the operating system tuning for HP-UX used when benchmarking for performance and scalability. These values are an example of how you might tune your system to achieve the desired result.

Table 10.3 Tuning HP-UX for performance benchmarking


Parameter	Scope	Default Value	Tuned Value	Comments
maxfiles	/stand/system	2048	4096	Must edit file by hand to increase beyond 2048 limit allowed by sam
maxfiles_lim	/stand/system	2048	4096	Must edit file by hand to increase beyond 2048 limit allowed by sam
tcp_time_wait_interval	ndd /dev/tcp	60000	60000
tcp_conn_req_max	ndd /dev/tcp	20	1024
tcp_ip_abort_interval	ndd /dev/tcp	600000	60000
tcp_keepalive_interval	ndd /dev/tcp	72000000	900000
tcp_rexmit_interval_initial	ndd /dev/tcp	1500	1500
tcp_rexmit_interval_max	ndd /dev/tcp		60000	60000
tcp_rexmit_interval_min	ndd /dev/tcp	500	500
tcp_xmit_hiwater_def	ndd /dev/tcp	32768	32768
tcp_recv_hiwater_def	ndd /dev/tcp	32768	32768