Understanding the document outlining algorithm can be a challenge,
but the rewards are well worth it. No longer will you agonize over
whether to use a
So, let’s start with a sample outline. Imagine you have built a website for a horse breeder, and he wants a page to advertise horses that he is selling. The structure of the page might look something like this:
That’s all it is: a nice, clean, easy-to-follow list of headings, displayed in a hierarchy — much like a table of contents.
To make things even simpler, only two things in your mark-up affect the outline of a Web page:
Figure 2: Our “Horses for sale” page, marked up using headings.
It’s as simple as that. The outline in figure 1 is created by the levels of the headings.
Just so you know that I’m not making this up, you should copy and paste the code above into Geoffrey Sneddon’s excellent outlining tool. Click the big “Outline this” button, et voila!
An outline created with heading content this way is said to consist of implicit, or implied, sections. Each heading creates its own implicit section, and any subsequent heading of a lower level starts another layer, of implicit sub-section, within it.
An implicit section is ended by a heading of the same level or higher. In our example, the “Mares” section is ended by the beginning of the “Stallions” section, and each section that contains details of an individual horse is ended by the beginning of the next one.
Figure 3 below is an example of an implicit section that ends with a heading of the same level. And figure 4 is an implicit section that ends with a heading of a higher level.
Figure 3: An implicit section being closed by a heading of the same level
Figure 4: An implicit section being closed by a heading of a higher level.
Figure 5: The horses page, marked up with some new HTML5 structural elements.
Now, I know what you’re thinking, but I haven’t taken leave of my
senses with these crazy headings. I am making a very important point,
which is that the outline is created by the sectioning content, not the headings.
Go ahead and copy and paste that code into the outliner, and you will see that the heading levels have absolutely no effect on the outline where sectioning content is used.
The
One of the most talked about features of HTML5 is that multiple
The part of the HTML5 spec that deals with headings and sections makes this clear:
This means that user agents that haven’t implemented the outlining algorithm can use implicit sectioning, and those that have implemented it can effectively ignore the heading levels and use sectioning content to create the outline.
At the time of this writing, no browsers or screen readers have implemented the outlining algorithm, which is why we need third-party testing tools such as the outliner. The latest versions of Chrome and Firefox style
When most user agents finally do support it, using an
Figure 6: Our horses page, marked up sensibly.
One other point worth noting here is the position of the paragraph
“All our horses come with full paperwork and a family tree.” In the
example that used headings to create the outline (figure 2), this
paragraph is part of the implicit section created by the “Brown Biscuit”
heading. Human readers will clearly see that this text applies to the
whole document, not just Brown Biscuit.
Sectioning content solves this problem quite easily, moving it back up to the top level, headed by “Horses for sale.”
And it creates a sensible hierarchical outline:
However, if you hope to achieve the same outline by nesting an explicit section inside an implicit section, it won’t work. The sectioning element will simply close the implicit section created by the heading and create a very different outline, as shown below:
This would produce the following outline:
There is no way to make the explicit sections created by the
You can use headings to split up the content of sectioning elements, but not the other way round.
There is no requirement to use headings for
Figure 9: An untitled
The
Untitled
Now, the spec doesn’t actually require
Where the
If you are unsure whether your untitled section is a
The reason for this is sectioning root. As the spec says, sectioning elements create sub-sections of their nearest ancestor sectioning root or sectioning content.
The
In practice, this means that headings inside any of the five sectioning root elements listed above do not affect the outline of the document that they are a part of.
The final thing (you’ll be glad to hear) that I’ll say about sectioning root is that the first heading in the document that is not inside sectioning content is considered to be the document title.
Try the following code in the outliner to see what happens:
Figure 10: How heading levels at the root level affect the outline.
I won’t try to explain this to you because it will probably only
confuse both of us, so I’ll let you play with it in the outliner. Hint:
try using different heading levels for the implicit sections to see how
the outline is affected; for example,
Roger Johansson addresses this issue in his excellent article on document outlines and HTML5 and the follow-up article.
Johansson asks how a proper document outline is supposed to be created for a blog post or other news-type item using HTML5. If you subscribe to the belief that your logo or website name should not be in an
The document is untitled. Somewhat reluctantly, Johansson settles on marking up the website’s title in
This same approach is also widely used on static pages that are built with HTML5 structural elements, and it could be very useful indeed for screen reader users. Imagine that you are using a screen reader to find a decent recipe for chicken pie, and you have a handful of recipe websites open for comparison. Being able to quickly find out which website you are on using the shortcut key for headings would be much more useful than seeing only “chicken pie” on each one.
Not too far behind two top-level headings in the screen reader user survey was one top-level heading for the document. This is probably my preferred option in most cases; but as we have already seen, it creates an untitled body, which is undesirable.
In my opinion, there is an easy way around this problem: don’t use
Remember, you can still use div!
It has been and continues to be the subject of controversy, and its inclusion in the specification is by no means a given. However, for now, it does exactly what it says on the tin: it groups headings into one, as far as the outlining algorithm is concerned.
section
or div
element — you will know straight away. Moreover, you will know why these elements are used, and this knowledge of semantics is the biggest benefit of learning how the algorithm works.
(Smashing’s note: Subscribe to the Smashing eBook Library and get immediate unlimited access
to all Smashing eBooks, released in the past and in the future,
including digital versions of our printed books. At least 24 quality
eBooks a year, 60 eBooks during the first year. Subscribe today!)
What Is The Document Outlining Algorithm?
The document outlining algorithm is a mechanism for producing outline summaries of Web pages based on how they are marked up. Every Web page has an outline, and checking it is easy using a really simple free online tool, which we’ll cover shortly.So, let’s start with a sample outline. Imagine you have built a website for a horse breeder, and he wants a page to advertise horses that he is selling. The structure of the page might look something like this:
That’s all it is: a nice, clean, easy-to-follow list of headings, displayed in a hierarchy — much like a table of contents.
To make things even simpler, only two things in your mark-up affect the outline of a Web page:
- heading content (
h1
toh6
andhgroup
), - sectioning content (
section
,article
,aside
andnav
).
Creating Outlines With Heading Content
To create a structure for the horses page outlined in figure 1, we could use mark-up like the following:01 | < div > |
02 | < h1 >Horses for sale</ h1 > |
03 |
04 | < h2 >Mares</ h2 > |
05 |
06 | < h3 >Pink Diva</ h3 > |
07 | < p >Pink Diva has given birth to three Grand National winners.</ p > |
08 |
09 | < h3 >Ring a Rosies</ h3 > |
10 | < p >Ring a Rosies has won the Derby three times.</ p > |
11 |
12 | < h3 >Chelsea’s Fancy</ h3 > |
13 | < p >Chelsea’s Fancy has given birth to three Gold Cup winners.</ p > |
14 |
15 | < h2 >Stallions</ h2 > |
16 |
17 | < h3 >Korah’s Fury</ h3 > |
18 | < p >Korah’s Fury has fathered three champion race horses.</ p > |
19 |
20 | < h3 >Sea Pioneer</ h3 > |
21 | < p >Sea Pioneer has won The Oaks three times.</ p > |
22 |
23 | < h3 >Brown Biscuit</ h3 > |
24 | < p >Brown Biscuit has fathered nothing of any note.</ p > |
25 |
26 | < p >All our horses come with full paperwork and a family tree.</ p > |
27 | </ div > |
Just so you know that I’m not making this up, you should copy and paste the code above into Geoffrey Sneddon’s excellent outlining tool. Click the big “Outline this” button, et voila!
An outline created with heading content this way is said to consist of implicit, or implied, sections. Each heading creates its own implicit section, and any subsequent heading of a lower level starts another layer, of implicit sub-section, within it.
An implicit section is ended by a heading of the same level or higher. In our example, the “Mares” section is ended by the beginning of the “Stallions” section, and each section that contains details of an individual horse is ended by the beginning of the next one.
Figure 3 below is an example of an implicit section that ends with a heading of the same level. And figure 4 is an implicit section that ends with a heading of a higher level.
1 | < h3 >Sea Pioneer</ h3 >
|
2 | < p >Sea Pioneer has won The Oaks three times.</ p > |
3 | |
4 | < h3 >Brown Biscuit</ h3 >
|
1 | < h3 >Chelsea’s Fancy</ h3 >
|
2 | < p >Chelsea’s Fancy has given birth to 3 Gold Cup winners.</ p > |
3 | |
4 | < h2 >Stallions</ h2 >
|
Creating Outlines With Sectioning Content
Now that we know how heading content works in creating an outline, let’s mark up our horses page using some new HTML5 structural elements:01 | < div > |
02 | < h6 >Horses for sale</ h6 > |
03 | |
04 | < section > |
05 | < h1 >Mares</ h1 > |
06 |
07 | < article > |
08 | < h1 >Pink Diva</ h1 > |
09 | < p >Pink Diva has given birth to three Grand National winners.</ p > |
10 | </ article > |
11 | |
12 | < article > |
13 | < h5 >Ring a Rosies</ h5 > |
14 | < p >Ring a Rosies has won the Derby three times.</ p > |
15 | </ article > |
16 | |
17 | < article > |
18 | < h2 >Chelsea’s Fancy</ h2 > |
19 | < p >Chelsea’s Fancy has given birth to three Gold Cup winners.</ p > |
20 | </ article > |
21 | </ section > |
22 |
23 | < section > |
24 | < h6 >Stallions</ h6 > |
25 |
26 | < article > |
27 | < h3 >Korah’s Fury</ h3 > |
28 | < p >Korah’s Fury has fathered three champion race horses.</ p > |
29 | </ article > |
30 | |
31 | < article > |
32 | < h3 >Sea Pioneer</ h3 > |
33 | < p >Sea Pioneer has won The Oaks three times.</ p > |
34 | </ article > |
35 |
36 | < article > |
37 | < h1 >Brown Biscuit</ h1 > |
38 | < p >Brown Biscuit has fathered nothing of any note.</ p > |
39 | </ article > |
40 | </ section > |
41 |
42 | < p >All our horses come with full paperwork and a family tree.</ p > |
43 | </ div > |
Go ahead and copy and paste that code into the outliner, and you will see that the heading levels have absolutely no effect on the outline where sectioning content is used.
The
section
, article
, aside
and nav
elements are what create the outline, and this time the sections are called explicit sections.One of the most talked about features of HTML5 is that multiple
h1
elements are allowed, and this is why. It’s not an open invitation to mark up every heading on the page as h1
; rather, it’s an acknowledgement that where sectioning content is used, it creates the outline, and that each explicit section has its own heading structure.The part of the HTML5 spec that deals with headings and sections makes this clear:
Sections may contain headings of any rank, but authors are strongly encouraged to either use onlyI would strongly advise that until browsers — and, more critically, screen readers — understand that sectioning content introduces a sub-section, using multipleh1
elements, or to use elements of the appropriate rank for the section’s nesting level.
h1
elements is less safe than
using a heading structure that reflects the level of each heading in the
document, as shown in figure 6 below.This means that user agents that haven’t implemented the outlining algorithm can use implicit sectioning, and those that have implemented it can effectively ignore the heading levels and use sectioning content to create the outline.
At the time of this writing, no browsers or screen readers have implemented the outlining algorithm, which is why we need third-party testing tools such as the outliner. The latest versions of Chrome and Firefox style
h1
elements in nested sections differently, but that is very different from actually implementing the algorithm.When most user agents finally do support it, using an
h1
in every explicit section will be the preferred option. It will allow
syndication tools to handle articles without needing to reformat any
heading levels in the original content.01 | < div > |
02 | < h1 >Horses for sale</ h1 > |
03 |
04 | < section > |
05 | < h2 >Mares</ h2 > |
06 |
07 | < article > |
08 | < h3 >Pink Diva</ h3 > |
09 | < p >Pink Diva has given birth to three Grand National winners.</ p > |
10 | </ article > |
11 |
12 | < article > |
13 | < h3 >Ring a Rosies</ h3 > |
14 | < p >Ring a Rosies has won the Derby three times.</ p > |
15 | </ article > |
16 |
17 | < article > |
18 | < h3 >Chelsea’s Fancy</ h3 > |
19 | < p >Chelsea’s Fancy has given birth to three Gold Cup winners.</ p > |
20 | </ article > |
21 | </ section > |
22 |
23 | < section > |
24 | < h2 >Stallions</ h2 > |
25 |
26 | < article > |
27 | < h3 >Korah’s Fury</ h3 > |
28 | < p >Korah’s Fury has fathered three champion race horses.</ p > |
29 | </ article > |
30 |
31 | < article > |
32 | < h3 >Sea Pioneer</ h3 > |
33 | < p >Sea Pioneer has won The Oaks three times.</ p > |
34 | </ article > |
35 |
36 | < article > |
37 | < h3 >Brown Biscuit</ h3 > |
38 | < p >Brown Biscuit has fathered nothing of any note.</ p > |
39 | </ article > |
40 | </ section > |
41 |
42 | < p >All our horses come with full paperwork and a family tree.</ p > |
43 | </ div > |
Sectioning content solves this problem quite easily, moving it back up to the top level, headed by “Horses for sale.”
Mixing It Up
So, what happens when implicit sections and explicit sections are combined? As long as you remember that implicit sections can go inside explicit sections, but not the other way round, you will be fine. For example, the following works well and is perfectly valid:01 | < h1 >Horses for sale</ h1 > |
02 |
03 | < section > |
04 | < h2 >Mares</ h2 > |
05 |
06 | < h3 >Pink Diva</ h3 > |
07 | < p >Pink Diva has given birth to three Grand National winners.</ p > |
08 |
09 | < h3 >Ring a Rosies</ h3 > |
10 | < p >Ring a Rosies has won the Derby three times.</ p > |
11 |
12 | < h3 >Chelsea’s Fancy</ h3 > |
13 | < p >Chelsea’s Fancy has given birth to three Gold Cup winners.</ p > |
14 | </ section > |
However, if you hope to achieve the same outline by nesting an explicit section inside an implicit section, it won’t work. The sectioning element will simply close the implicit section created by the heading and create a very different outline, as shown below:
01 | < h1 >Horses for sale</ h1 > |
02 |
03 | < h2 >Mares</ h2 > |
04 |
05 | < article > |
06 | < h3 >Pink Diva</ h3 > |
07 | < p >Pink Diva has given birth to three Grand National winners.</ p > |
08 | </ article > |
09 |
10 | < article > |
11 | < h3 >Ring a Rosies</ h3 > |
12 | < p >Ring a Rosies has won the Derby three times.</ p > |
13 | </ article > |
14 |
15 | < article > |
16 | < h3 >Chelsea’s Fancy</ h3 > |
17 | < p >Chelsea’s Fancy has given birth to three Gold Cup winners.</ p > |
18 | </ article > |
There is no way to make the explicit sections created by the
article
elements become sub-sections of the Mare’s implicit section.You can use headings to split up the content of sectioning elements, but not the other way round.
Things To Watch Out For
Untitled Sections
Until now we haven’t really looked atnav
and aside
, but they work exactly the same as section
and article
.
If you have secondary content that is generally related to your
website — say, horse-training tips and industry news — you would mark it
up as an aside
, which creates an explicit section in the document outline. Similarly, major navigation would be marked up as nav
, again creating an explicit section.There is no requirement to use headings for
aside
and nav
, so they can appear in the outline as untitled sections. Go ahead and try the following code in the outliner:01 | < nav > |
02 | < ul > |
03 | < li >< a href = "/" >home</ a ></ li > |
04 | < li >< a href = "/about.html" >about us</ a ></ li > |
05 | < li >< a href = "/horses.html" >horses for sale</ a ></ li > |
06 | </ ul > |
07 | </ nav > |
08 |
09 | < h1 >Horses for sale</ h1 > |
10 |
11 | < section > |
12 | < h2 >Mares</ h2 > |
13 | </ section > |
14 |
15 | < section > |
16 | < h2 >Stallions</ h2 > |
17 | </ section > |
nav
appears as an untitled section. Now, this
generally wouldn’t be a problem and is not considered bad HTML5 code,
although in his recent HTML5 Doctor article on outlining, Mike Robinson recommends using headings for all sectioning content in order to increase accessibility.Untitled
section
and article
elements, on the other hand, are generally to be avoided. In fact, if you’re unsure whether to use a section
or article
,
a good rule of thumb is to see whether the content has a natural,
logical heading. If it doesn’t, then you will more than likely be wiser
to use a good old div
.Now, the spec doesn’t actually require
section
elements to have a title. It says:The section element represents a generic section of a document or application. A section, in this context, is a thematic grouping of content, typically with a heading.Your interpretation of this probably hinges on your understanding of the word “typically.” I take it to mean that you need a damn good reason not to use headings with
section
elements. I do not take it to mean that you can ignore it whenever you feel the urge to use a new HTML5 element.Where the
article
element is specified, the spec goes even further by showing an example of blog comments marked up as untitled article
s, so there are exceptions. However, if you see an untitled section
or article
in the outline, make sure you have a good reason for not giving it a title.If you are unsure whether your untitled section is a
nav
, aside
, section
or article
, a very handy Opera extension
will let you know which type of sectioning content you have left
untitled. The tool will also let you view the outline without leaving
the page, which can be hugely beneficial when you’re debugging sections.Sectioning Root
The eagle-eyed among you will have noticed that when I said that sectioning content cannot create a sub-section of an implicit section, there was anh1
(“Horses for sale”) not in sectioning content immediately followed by a section
(“Mares”), and that the sectioning content did actually create a sub-section of the h1
.The reason for this is sectioning root. As the spec says, sectioning elements create sub-sections of their nearest ancestor sectioning root or sectioning content.
Sectioning content elements are always considered subsections of their nearest ancestor sectioning root or their nearest ancestor element of sectioning content, whichever is nearest, regardless of what implied sections other headings may have created.The
body
element is sectioning root. So, if you paste the code from figure 7 into the outliner, the h1
would be the sectioning root heading, and the section
element would be a sub-section of the body
sectioning root.The
body
element is not the only one that acts as sectioning root. There are five others:blockquote
details
fieldset
figure
td
In practice, this means that headings inside any of the five sectioning root elements listed above do not affect the outline of the document that they are a part of.
The final thing (you’ll be glad to hear) that I’ll say about sectioning root is that the first heading in the document that is not inside sectioning content is considered to be the document title.
Try the following code in the outliner to see what happens:
1 | < section > |
2 | < h1 >this is an h1</ h1 > |
3 | </ section > |
4 |
5 | < h6 >this h6 comes first in the source</ h6 > |
6 |
7 | < h1 >this h1 comes last in the source</ h1 > |
h3
and h4
, or two h5
s.Untitled Documents
If no heading is at the root level of the document (i.e. not inside sectioning content), then the document itself will be untitled. This is a pretty serious problem, and it can occur either through carelessness or, paradoxically, by thinking carefully about how sectioning content should be used.Roger Johansson addresses this issue in his excellent article on document outlines and HTML5 and the follow-up article.
Johansson asks how a proper document outline is supposed to be created for a blog post or other news-type item using HTML5. If you subscribe to the belief that your logo or website name should not be in an
h1
element, you could mark up your blog post along the lines of the following:1 | < body > |
2 | < article > |
3 | < h1 >Blog post title</ h1 > |
4 |
5 | < p >Blog post content</ p > |
6 | </ article > |
7 | </ body > |
h1
and using another h1
to mark up the article’s title. This is a sensible solution and is backed up by the results of the WebAIM screenreader user survey, in which the majority of respondents stated a preference for two top-level headings in exactly this format.This same approach is also widely used on static pages that are built with HTML5 structural elements, and it could be very useful indeed for screen reader users. Imagine that you are using a screen reader to find a decent recipe for chicken pie, and you have a handful of recipe websites open for comparison. Being able to quickly find out which website you are on using the shortcut key for headings would be much more useful than seeing only “chicken pie” on each one.
Not too far behind two top-level headings in the screen reader user survey was one top-level heading for the document. This is probably my preferred option in most cases; but as we have already seen, it creates an untitled body, which is undesirable.
In my opinion, there is an easy way around this problem: don’t use
article
as a wrapper for single-blog posts, news items or static page main content. Remember that article
is sectioning content: it creates a sub-section of the document. But in
these cases, the document is the content, and the content is the
document. Setting aside the name of the element, why would we want to
create a sub-section of a document before it has even begun?Remember, you can still use div!
hgroup
This is the final item in the list of things to watch out for, and it’s very easy to understand. Thehgroup
element can contain only headings (h1
to h6
), and its purpose is to remove all but the highest-level heading it contains from the outline.It has been and continues to be the subject of controversy, and its inclusion in the specification is by no means a given. However, for now, it does exactly what it says on the tin: it groups headings into one, as far as the outlining algorithm is concerned.
HTML5 And The Document Outlining Algorithm
Reviewed by JohnBlogger
on
6:09 PM
Rating:
No comments: