Learn more about Platform products at http://www.platform.com

[ Platform Documentation ] [ Title ] [ Contents ] [ Previous ] [ Next ] [ Index ]



Welcome


Contents

[ Top ]


About This Guide

Last update

November 12 2004

Latest version

www.platform.com/services/support/docs_home.asp

Purpose of this guide

This guide serves as the installation guide for LSF products on Windows, and as a supplement to Administering Platform LSF. In it, you will find all the information you need to do the following:

Who should use this guide

This guide is written for Platform LSF system administrators and LSF users whose cluster includes Windows hosts.

What you should already know

This guide assumes you are familiar with:

See Administering Platform LSF for information about fundamental LSF concepts.

How to find out more

To learn more about LSF:

Typographical Conventions

Typeface Meaning Example
Courier
The names of on-screen computer output, commands, files, and directories
The lsid command
Bold Courier
What you type, exactly as shown
Type cd /bin
Italics
  • Book titles, new words or terms, or words to be emphasized
  • Command-line place holders--replace with a real name or value
The queue specified by queue_name
Bold Sans Serif
  • Names of GUI elements that you manipulate
Click OK

Command Notation

Notation Meaning Example
Quotes " or '
Must be entered exactly as shown
"job_ID[index_list]"
Commas ,
Must be entered exactly as shown
-C time0,time1
Ellipsis ...
The argument before the ellipsis can be repeated. Do not enter the ellipsis.
job_ID ...
lower case italics
The argument must be replaced with a real value you provide.
job_ID
OR bar |
You must enter one of the items separated by the bar. You cannot enter more than one item, Do not enter the bar.
[-h | -V]
Parenthesis (  )
Must be entered exactly as shown
-X "exception_cond([params])::action] ... 
Option or variable in square brackets [  ]
The argument within the brackets is optional. Do not enter the brackets.
lsid [-h]
Shell prompts
  • C shell: %
  • Bourne shell and Korn shell: $
  • root account: #
Unless otherwise noted, the C shell prompt is used in all command examples
% cd /bin

[ Top ]


About Platform LSF

What LSF can do for you

Platform LSF provides workload management services across a cluster of networked computers. LSF allows you to control and manage all of your networked computing resources to run jobs and maximize system resources.

Manage networked resources

By using a network of computers as a single system, you are no longer limited to the resources on your own workstation. Platform LSF automatically selects the best hosts for the job based on the current load conditions and the resource requirements of the applications.

Run jobs

Run batch jobs automatically when required resources become available, or when systems are lightly loaded. LSF maintains full control over the jobs, including the ability to suspend and resume the jobs based on load conditions.

Platform LSF can run both sequential and parallel applications as interactive and batch jobs.

Manage applications

Transparently run software that is not available on your local host. For example, you could run a CAD tool that is only available on an HP host from your Sun workstation. The job would run on the HP host and be displayed transparently on your Sun system.

You do not need to rewrite or change your programs to take advantage of Platform LSF. Nearly any UNIX or Windows command or third-party application can be run using LSF commands.

Control access to system resources

System administrators can easily control access to resources such as:

Implement scheduling policies

LSF provides built-in scheduling policies to manage the efficient operation of the LSF Batch system. Configure your own fairshare policies at the queue or host partition level to manage conflicting demands for computing resources. Configure queues such as preemptive, preemptable, exclusive, to implement business policies other than standard first-come, first-served scheduling.

Reserve resources

Resources can be reserved by a job or configured in a queue. Reserving resources guarantees that while the job is running enough resources will be available. As jobs run, their resource requirements change, and using resource reservation helps alleviate this problem.

Share resources among hosts

Shared resources are available among a set of machines. A typical application of shared resources is to manage networked floating licenses for software applications. Any machine that has the software tool installed can request a license to run the application as a shared resource. LSF maximizes the use of network floating licenses by ensuring that a license is available when a batch job is dispatched to a host.

LSF architecture

Platform LSF extends operating system services to create a single system image for a network of computers. LSF is built in layers, with the base system services providing dynamic load balancing and transparent access to the resources available on all machines participating in the cluster. The LSF Batch system is built on the base cluster system services and provides a central, scalable, fault tolerant batch queuing and job scheduling system.

LIM, RES and LSLIB form the LSF base system

The LSF base system provides basic load-sharing services for the cluster. All other Platform LSF products are built on this base. It provides services such as resource information, host selection, placement advice, transparent remote execution, and remote file operation.

The LSF base system comprises the Load Information Manager (LIM), Remote Execution Server (RES), and the LSF Base API (LSLIB).

Structure of LSF Batch

LSF Batch is a distributed batch system for clusters of UNIX and Windows computers. All batch jobs go through a consistent interface, independent of the resources they need or the hosts they run on.

LSF Batch is built on top of the LSF base system, extending its services to provide batch queuing and job scheduling.

How LSF Batch runs jobs

LSF Batch accepts user jobs and holds them in queues until suitable hosts are available. Host selection is based on up-to-date load information from the master LIM, so LSF can take full advantage of all your hosts without overloading any one host.

Master Batch Daemon (mbatchd)

One master batch daemon (mbatchd) runs on the same host as the master LIM. A slave batch daemon (sbatchd) runs on each LSF server host. User jobs are held in queues by mbatchd, which checks the load information on all candidate hosts periodically. When a host with the necessary resources becomes available, mbatchd sends the job to sbatchd on that host for execution. When more than one host is available, the best host is chosen. The slave batch daemon controls the execution of the jobs and reports job status to the master batch daemon.

Master Batch Scheduler Daemon (mbschd)

The master batch scheduler daemon runs on the master server host. It performs the scheduling functions of LSF and dispatches jobs to sbatchd on the execution host.

LSF Batch API library (LSBLIB)

The LSF Batch Library (LSBLIB) is the Application Programming Interface (API) for LSF Batch, providing easy access to the services of mbatchd, sbatchd, and mbschd. LSBLIB provides a powerful interface for advanced users to develop new batch processing applications in C.

[ Top ]


Learning About Platform LSF

World Wide Web and FTP

The latest information about all supported releases of Platform LSF is available on the Platform Web site at www.platform.com. Look in the Online Support area for current README files, Release Notes, Upgrade Notices, Frequently Asked Questions (FAQs), Troubleshooting, and other helpful information.

The Platform FTP site (ftp.platform.com) also provides current README files, Release Notes, and Upgrade information for all supported releases of Platform LSF.

Visit the Platform User Forum at www.platformusers.net to discuss workload management and strategies pertaining to distributed and Grid Computing.

If you have problems accessing the Platform web site or the Platform FTP site, contact support@platform.com.

Platform training

Platform's Professional Services training courses can help you gain the skills necessary to effectively install, configure and manage your Platform products. Courses are available for both new and experienced users and administrators at our corporate headquarters and Platform locations worldwide.

Customized on-site course delivery is also available.

Find out more about Platform Training at www.platform.com/training, or contact Training@platform.com for details.

README files and release notes and UPGRADE

Before installing LSF, be sure to read the files named readme.html and release_notes.html. To upgrade to Version 6.1, follow the steps in upgrade.html.

You can also view these files from the Download area of the Platform Online Support Web page.

Platform documentation

Documentation for Platform products is available in HTML and PDF format on the Platform Web site at www.platform.com/services/support/docs_home.asp.

[ Top ]


Getting Technical Support

Contacting Platform

Contact Platform Computing or your LSF vendor for technical support. Use one of the following to contact Platform technical support:

Email

support@platform.com

World Wide Web

www.platform.com

Mail

Platform Support
Platform Computing Corporation
3760 14th Avenue
Markham, Ontario
Canada L3R 3T7

When contacting Platform, please include the full name of your company.

See the Platform Web site at www.platform.com/contactus for other contact information.

We'd like to hear from you

If you find an error in any Platform documentation, or you have a suggestion for improving it, please let us know:

Email

doc@platform.com

Mail

Information Development
Platform Computing Corporation
3760 14th Avenue
Markham, Ontario
Canada L3R 3T7

Be sure to tell us:

[ Top ]


[ Platform Documentation ] [ Title ] [ Contents ] [ Previous ] [ Next ] [ Index ]


      Date Modified: November 12, 2004
Platform Computing: www.platform.com

Platform Support: support@platform.com
Platform Information Development: doc@platform.com

Copyright © 1994-2004 Platform Computing Corporation. All rights reserved.