DOM(Document Object Model)解析入門

原創

zmjiangi

2020-02-21 23:02

2014年12月31日08:36:29 天氣晴

首先我們學習一下基本的知識

一DOM樹節點概念

DOM將XML文檔作爲一個樹形結構，而樹葉被定義爲爲節點
根據DOM,XML文檔中的每個成分都是一個節點
DOM是這樣規定的：
1、整個文檔是一個文檔節點（Document Node）
2、每個XML的標籤是一個元素節點（Element Node）
3、包含在XML元素中的文本是文本節點（Text Node）
4、每一個XML屬性是一個屬性節點（Attirbute Node）
5、註釋屬於註釋節點（Annotation Node）

二DOM解析編程過程步驟：dom4j下載地址

(下載的dom4j.jar中沒有SAXReader這個類...)

A.創建一個解析器工廠對象DocumentBuliderFactory

B.創建一個解析器對象DocumnetBuilder

C.創建一個文檔對象Document(org.w3c.dom.Documnet),到此在內存裏就會形成與該XML文檔對象的一個“樹模型”

D.調用相關的API對該文檔對象進行操作

注：DOM解析技術來自org.w3c.dom

student.xml

<?xml version="1.0" encoding="UTF-8"?>
<students>
	<student id="studentId">
		<name>LGP</name>
		<age>24</age>
	</student>
</students>

ReadXmlByDOM.java

package com.gditc.test;

import java.io.File;
import java.io.IOException;

import javax.xml.parsers.DocumentBuilder;
import javax.xml.parsers.DocumentBuilderFactory;
import javax.xml.parsers.ParserConfigurationException;

import org.w3c.dom.Document;
import org.w3c.dom.Element;
import org.w3c.dom.Node;
import org.w3c.dom.NodeList;
import org.xml.sax.SAXException;


public class ReadXmlByDOM {
	public static void main(String[] args) {
		 readXmlFileByDom("xml/student.xml");
	}

	private static void readXmlFileByDom(String xmlFile)  {
		try {
			//創建一個解析器工廠
			DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();
			
			//創建一個解析器對象
			DocumentBuilder db = dbf.newDocumentBuilder();
			
			//創建一個文檔對象，到此，在內存裏就會形成與該XML文檔對象的一個"樹模型"
			Document document = db.parse(new File(xmlFile));
			
			System.out.println("【獲得文檔樹基本信息如下】：");
			System.out.println("xml版本"+document.getXmlVersion());
			System.out.println("xml編碼"+document.getXmlEncoding());
			System.out.println("是否實體引用"+document.getXmlStandalone());
			System.out.println("【獲得學生信息如下】：");
			NodeList nodeList = document.getElementsByTagName("student");
			
			//理解後寫的,這是已知道xml的結構下做的遍歷
			for (int i=0; i<nodeList.getLength(); i++) {
				Node node = nodeList.item(i);
				NodeList nodeList2 = node.getChildNodes();
				System.out.println(nodeList2.getLength());
				for (int j=0; j<nodeList2.getLength(); j++) {
					Node node2 = nodeList2.item(j);
					if (node2 instanceof Element) {
						Element e = (Element) node2;
						System.out.println(e.getNodeName()+":"+e.getTextContent());
					}
				}
			}
			
			/*//參考
			for (int i=0; i<nodeList.getLength(); i++) {
				//studentNode是student節點
				Node studentNode = nodeList.item(i);
				
				if (studentNode instanceof Element) {
					Element studentElement = (Element) studentNode;
					String attrId = studentElement.getAttribute("id");
					System.out.println("id屬性值"+attrId);
					
					//subNodeList代表Student節點下的所有子節點列表
					NodeList subNodeList = studentElement.getChildNodes();
					System.out.println("【學生信息如下】：");
					for (int j=0; j<subNodeList.getLength(); j++) {
						Node subStudentNode = subNodeList.item(j);
						
						//由於xml文檔可以存在空文本節點，因此，需要對此情況做處理
						if (subStudentNode instanceof Element) {
							Element e = (Element) subStudentNode;
							System.out.println(e.getTagName()+":"+e.getTextContent());
						}
					}
				}
			}*/
			
		} catch (ParserConfigurationException e) {
			e.printStackTrace();
		} catch (SAXException e) {
			// TODO Auto-generated catch block
			e.printStackTrace();
		} catch (IOException e) {
			// TODO Auto-generated catch block
			e.printStackTrace();
		}
	}
}

注意：node2 instanceof Element這個判斷條件是必須有的，我們需要獲取元素（標籤）中的文本，對於非標籤中的空白文本是不需要取得

zmjiangi

發佈了32 篇原創文章 · 獲贊 8 · 訪問量 7萬+

私信關注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

DOM(Document Object Model)解析入門

一DOM樹節點概念

二DOM解析編程過程步驟：dom4j下載地址

python gdal 安裝使用（Windows， python 3.6.8）

HTML Entities

整合三大框架出現的一些常見的問題及解決的方案

使用JackSon

javascript11位手機號碼正則表達式

bootstrap2.3 與 bootstrap3.3 modal嵌入表單的樣式改變

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結

DOM(Document Object Model)解析 入門

一DOM樹節點概念

二DOM解析編程過程步驟：dom4j下載地址

DOM(Document Object Model)解析入門