Zibb
Subscribe to Control Engineering
FirstLight
Email
Print
Reprint
Learn RSS

XML: The future of data management

Laura Zurawski -- Control Engineering, 7/1/2000

XML (eXtensible Markup Language) is derived from SGML (Standard Generalized Markup Language). SGML was created as an international standard (ISO 8879) in 1986 and provides a standard format for embedding descriptive markup into documents. Descriptive markup refers to pieces of code (often called "tags") that define the presentation of a document.

XML takes SGML a little further by redefining some internal parameters to make the language suited for use on the World Wide Web. What makes XML different from other web languages such as HTML (Hypertext Markup Language, another derivative of SGML) is that markup tags are not pre-defined.

How XML works

HTML is fine for simply displaying documents. However, if there is a need to actually make use of the information contained in these documents, HTML can't do much to help. This is where XML can come in handy. User-defined tags created in XML can be processed by other web applications and used in much more dynamic ways than was previously possible with HTML.

For example, consider the HTML in fig. 1:

What is returned looks like a bulleted list. This is fine if you want to simply display the information, but what if you want to use it for something else? Perhaps you want to pull the information into another web page somewhere else on the server, in a different format. Then you would need XML, which you could use to produce a file like fig. 2:

This looks different than the HTML, and it works in a different way. Tags in the HTML file describe how information contained in the tags looks. Tags in the XML file describe what the information is.

Associated with the XML file is a document type definition (DTD), which describes what to do with user-defined tags. A DTD for the above XML example might look like fig. 3:

This DTD gives the parameters of the tags. The element "mailinglist" may or may not contain an intro, and must contain a name, address, and city, and country. The other elements, defined as "#PCDATA," are all plain text, defined by what the user types in.

The DTD can be used to extract the information in the XML file and place it somewhere else, such as a program that creates mailing labels.

Valid, well-formed

An XML program that has an associated DTD, and conforms to it, is called valid. However, one of the ways in which XML differs from SGML is that is does not necessarily need to be valid to function. An XML file can be created independently of a DTD, the caveat being that the tags defined in the file are limited only to that one file. By using a DTD, the user can create multiple files with the same tags.

Whether or not the file uses a DTD, it must be well-formed. This means that it must adhere to certain guidelines to ensure that the file is parsed correctly. Some of these guidelines include:

  • All elements must have a start and end tag;
  • All "empty" HTML tags (tags that do not require an end tag in HTML) must end with "/>;"
  • If no DTD is used, the file must be defined as "standalone;"
  • All element attributed must be in quotes; and
  • Tags must be nested in proper order.
What's ahead

XML has been in existence for only a short time, but there are already initiatives underway to improve it. XML enthusiasts are developing schemas that go beyond the capabilities of regular DTDs. These schemas may eventually lead to industry standards.

One such development is Microsoft's (Redmond, Wa.) BizTalk initiative, which aims to provide an open design framework for implementing XML schemas throughout business and industry. Sequencia (Phoenix, Ariz.) is one of the first companies to develop BizTalk schemas for the process industries. XML is the driving force behind its latest Internet endeavor, ProcessPoint.com, a business-to-business portal based on published BizTalk schemas to enable interaction between industry professionals.

For more information about XML and BizTalk, go to www.controleng.com.


Author Information
Laura Zurawski, web editor lzurawski@cahners.com

 

Fig. 1

<font face="arial" size="4">

<b><i>This is some information:</i></b>

<ul>

<li>John Q. Public</li>

<li>1234 Main Street</li>

<li>Anytown</li>

<li>USA</li>

</ul>

</font>

Fig. 2

<mailinglist>

<intro>This is some information:</intro>

<name>John Q. Public</name>

<address>1234 Main Street</address>

<city>Anytown</city>

<country>USA</country>

</mailinglist>

Fig. 3

<!doctype mailinglist [

<!element mailinglist (intro?,

name, address, city, country)>

<!element intro (#PCDATA)>

<!element name (#PCDATA)>

<!element address (#PCDATA)>

<!element city (#PCDATA)>

<!element country (#PCDATA)>

Email
Print
Reprint
Learn RSS

Talkback

We would love your feedback!

Post a comment

» VIEW ALL TALKBACK THREADS

Related Content

Related Content

 

By This Author

Sponsored Links

 

Advertisement
SPONSORED LINKS

More Content

  • Blogs
  • Discussions
  • Webcasts
  • Podcasts
  • Videos

Blogs

  • Peter Welander
    Pillar to Post: Peter Welander's Blog

    January 7, 2009
    Is nothing growing in manufacturing?
    Manufacturing is down. You might have already heard something to this effect, but the extent of the economic downturn in December may surprise you.......
    More
  • Peter Welander
    Pillar to Post: Peter Welander's Blog

    January 6, 2009
    Ted Turner calls for clean energy
    Last Sunday there was an interesting group of editorials in the Chicago Tribune by names as diverse as Muammar Gaddafi and George McGover......
    More
  • View All BlogsRSS

Webcasts

Engineering-driven Ethernet
This Control Engineering Roundtable Webcast will address the engineering issues you should be aware of when exploring the adoption of Ethernet or when looking to expand its use in your facility.

Bridging gaps with wireless
Discover how you can create stronger, flexible and cost-effective wireless connections for your entire plant. Register today!

View All Webcasts

Podcasts

Integration platforms such as EDDL and FDT/DTM help integrate increasingly intelligent instrumentation and devices with control and asset management systems. This audio tutorial helps you understand how these platforms operate and what they can do.
Integration Platforms: EDDL and FDT/DTM
Integration platforms such as EDDL and FDT/DTM help integrate increasingly intelligent instrumentation and devices with control and asset management systems. This audio tutorial helps you understand how these platforms operate and what they can do. Hear It Now

View All Podcasts Subscribe Now to Process Control & Instrumentation and never miss an episode
Advertisements





NEWSLETTERS

Get engineering industry news, trends, and business-critical information delivered directly to your inbox!

Click on a title below to learn more.

Weekly News (Weekly)
Process Instrumentation & Sensors (Monthly)
System Integration Monthly (Monthly)
Process & Advanced Control (Monthly)
Machine Control (Monthly)
Information Control (Monthly)
Automation Control (Monthly)
Product Review (Monthly)
Sustainable Engineering (Monthly)
Simplified Safety
Fieldbus Facts
PROFInews North American Edition
About Us   |   Advertising Info   |   Site Map   |   Contact Us   |   Useful Sites   |   FREE Subscription   |   RSS
© 2009 Reed Business Information, a division of Reed Elsevier Inc. All rights reserved.
Use of this Web site is subject to its Terms of Use | Privacy Policy
Please visit these other Reed Business sites