What is CDATA in HTML? [duplicate]

All text in an XML document will be parsed by the parser.

But text inside a CDATA section will be ignored by the parser.

CDATA – (Unparsed) Character Data

The term CDATA is used about text data that should not be parsed by the XML parser.

Characters like “<” and “&” are illegal in XML elements.

“<” will generate an error because the parser interprets it as the start of a new element.

“&” will generate an error because the parser interprets it as the start of an character entity.

Some text, like JavaScript code, contains a lot of “<” or “&” characters. To avoid errors script code can be defined as CDATA.

Everything inside a CDATA section is ignored by the parser.

A CDATA section starts with “<![CDATA[” and ends with “]]>

Use of CDATA in program output

CDATA sections in XHTML documents are liable to be parsed differently by web browsers if they render the document as HTML, since HTML parsers do not recognise the CDATA start and end markers, nor do they recognise HTML entity references such as &lt; within <script> tags. This can cause rendering problems in web browsers and can lead to cross-site scripting vulnerabilities if used to display data from untrusted sources, since the two kinds of parsers will disagree on where the CDATA section ends.

A brief SGML tutorial.

Also, see the Wikipedia entry on CDATA.

Leave a Comment