Page Object Model

ASPCompiler

Page Object Model

ASP Compiler 1.1 documentation is under development. See also the examples.

Document object models in general

Object models of certain documents are popular way to deal with them. Today many popular technologies use object models as part of their standard or related tools. Some of the most popular examples are the DHTML DOM and XML DOM. Using a structured representation of a document is much more convenient for any application than generic parsing mechanisms. This technique has one very important advantage - it specifies what a particular parser component/tool should serve to the application. Object models are used in many applications and while the models related to the world wide standards are standardized many other applications use similar techniques to represent their specific documents. Many samples are available - most of the MS Office applications, some of the Corel products, almost all the GIS applications and so on.

The role of the object model of a document is the document active state. What means the word "active" depends on the application specifics. Most often the active state is related to edit/modification operations over the document. Thus using object model is convenient way to supply flexible way to manage/modify the document and then at a certain moment to save it in a persistent form with all the changes applied.

ASP Compiler uses its component TextEmbedParser to construct object models of the processed files or their parts. If a general comparison should be made - ASPC functionality should be positioned somewhere in the middle between the editing tools and the unattended pre-processors. And the object model of the processed files is designed to match its needs. You may thinks for ASPC as for a document editing tool in which all the operations are performed sequentially without user interaction, but the user has a "proxy" interacting with the document during the file processing - the compile time scripts (CTS). Thus object model oriented structure brings flexibility and universality.

Although ASPC is oriented to HTML and ASP files it has no real need to deal with them using some of the standard object models - such as DHTML DOM. ASPC is not a browser or WYSWYG HTML editor thus it does not need to know the entire page structure. The processing made by ASPC likes a kind of a server side processing (for example ASP to VB processing), but it could benefit of some elements similar to the client side processing - some techniques similar to the DHTML client side scripting. However CTS in ASP Compiler could be used over the most textual formats and the processed file could be a HTML page but it could be even a plain text document with some custom markups. We made the logical step - a tree like representation of the file implemented with universal collections used to represent the nodes of the document. Therefore the document model (will be called Page Object Model - POM below) is a collection that contains other collections and they may contain other collections in turn. VarDictionary collection object is used for the purpose.

Some words about the VarDictionary object.

Looking at its documentation you will find out that it has the typical collection methods and properties but also serves many advanced features some convenient when dealing with tree like structures, others related to the general behavior of the object. It supports also cloning (see Clone method) - that allows a copy of the collection to be obtained with all of its sub-collections (if any). This allows a branch of a tree constructed with VarDictionaries to be cloned with one method call! Also note the FindByValue and FindByName methods which allow searching for sub-collections (tree nodes/branches) and other items through the entire structure.

Page Object Model in ASPC

ASP Compiler serves some predefined elements to the compile time scripts. Depending on the processing type they cold include ASP scripts in the page (<% %>, <%= %> and <SCRIPT RUNAT=SERVER> elements), embedded CTS (<SCRIPT RUNAT=COMPILER> elements), CUSTOM and CUSTOMFIELD elements (the predefined tags for developer defined operations), HTML comments.Also ASPC may serve specific HTML elements if they are explicitly requested by the compile time script (see OnModifyParser event).

Therefore the elements of the POM are two types - ASPC standard and CTS requested. Defining some standard ASPC elements allows cooperation between different compile time scripts (even from different vendors) and ability to request specific elements allows the model to be extended when needed.

General rules:

The HTML attributes are represented as string items in the VarDictionary object that represents the element in the POM. The name of the item is the name of the attribute and its value is the value set to the attribute in the file source. Thus this element:
<CUSTOMFIELD Attr1="Val1" Attr2=Val2>
will be represented with a collection which contains item named Attr1 with string value "Val1" and item named Attr2 with value "Val2".
When the CTS requested specific HTML tag to be included in the POM the same rule will be followed.

The ID attribute specifies the item name of the element in its parent collection. Therefore:
<HTML><BODY>
<CUSTOMFIELD ID="Field1">
</BODY></HTML>
Will appear in the POM as an item in the root collection of the document named "Field1" and the item will contain a VarDictionary object (sub-collection) representing the CUSTOMFIELD element (tag) and its attributes (if any). This rule is true for the specific elements requested by CTS too.
Therefore to distinguish attribute from an element node IsObject VBScript function can be used. The objects in the POM are always elements/nodes and non-object values are element's attributes. Note that the root of the document has no attributes - it represents the entire document thus it contains its root level elements/nodes only.

The standard ASPC defined information about the element is served as string items in the collection. In other words they appear in the same manner as the HTML attributes appear but their names are "well-known" (defined by ASPC).

Standard ASPC attributes for the ASPC defined elements (not the CTS requested elements):

Class - the type name of the element (node). The possible values are:

Text - non-parsed part of the document. Represents the parts of the page between the ASPC elements. These parts could be HTML or plain text. When the page is regenerated they appear "as is".

Script - Represent <% %> and <SCRIPT RUNAT=Server>...</SCRIPT> elements - ASP code. CTS may change them or use the ASP notation for non-ASP purposes if desired.

Assignment - Represents the <%= %> elements - ASP direct output.

ASPDirective - Represents the <% @... %> ASP elements. Supported for internal needs. Not recommended to use in CTS!

Object - Represents <OBJECT RUNAT=Server> ASP elements. Supported mostly for internal needs but CTS may use them if needed.

Custom - Represents the CUSTOM or CUSTOMFIELD element.

CompileTimeScript - Represents the <SCRIPT RUNAT=Compiler>...</SCRIPT> elements. These elements are expected to contain (or refer) script code the ASPC will execute when the POM is generated. The CTS attached to the file may access them and call functions defined in the contained script or interact with them in other ways.

IncludeBegin - A flag node. Marks the place in the POM where an  directive is used. Contains information for the included file only - the contents of the file follows as it is a part of the document. In other words include files are first resolved and if the CTS is not interested exactly how the page is constructed from the include files it may ignore the IncludeBegin and IncludeEnd nodes.

IncldeEnd - A flag node. Marks the place in the POM where the include file ended.

Comment - Represent HTML comments

Content - Contains the element's content. This is the content between the open and close tag (for the CUSTOM elements for example) or the ASP code between the <% %> or the text in the unparsed parts (Text nodes).

For the elements that follow HTML syntax the node also contains all their HTML attributes as it was mentioned above. For the ASP defined element Class and Content are reserved and should not be used as HTML attributes! Note that they are not visible elements thus they have not meaning for the WEB browsers. Therefore there is no relation between the DHTML class attribute and ASPC class attribute. Also the ASPC defined element will never appear in the output file.

CTS requested elements - see OnModifyParser event and TextEmbedParser reference. CTS requests some HTML elements to be included in the POM. The reason to request them could be information collecting task or other usage. Thus the elements requested are almost always regular HTML elements.

__class - the element (tag) name.

__container - numeric 0 - if the element has not closing tag, 1 - if it has a closing tag. Defined by the script through the AddTag method call when handling the OnModifyParser event. Note that CTS may want to treat a HTML element as an element that has no closing tag without risk - no matter if by HTML definition it has closing tag or not. However following the HTML definitions will make the CTS easier to read and understand.

To see the POM in action see the ASPC examples.