【推薦】oc解析HTML數據的類庫(爬取網頁數據)

来源:http://www.cnblogs.com/salam/archive/2016/03/15/5279783.html
-Advertisement-
Play Games

TFhpple是一個用於解析html數據的第三方庫,本人感覺功能還算可以,只不過在使用前必須配置項目。 配置 1.導入libxml2.tbd   2.設置編譯路徑   使用 這裡使用一個例子來說明 http://so.gushiwen.org/guwen/book_2.aspx    1.創建TFH


  TFhpple是一個用於解析html數據的第三方庫,本人感覺功能還算可以,只不過在使用前必須配置項目。

  

  配置

1.導入libxml2.tbd

 

2.設置編譯路徑

   使用

這裡使用一個例子來說明

http://so.gushiwen.org/guwen/book_2.aspx

 

 1.創建TFHpple對象,data為網站返回的數據

TFHpple *htmlParser = [[TFHpple alloc] initWithHTMLData:data];

 

 2.使用searchWithXPathQuery方法得到有用數據,XPATH知識具體百度

NSArray *temp1 = [htmlParser searchWithXPathQuery:@"//div[@class='shileft']/div[@class='bookcont']"]

這樣我們獲取了論語的數據

 

3。獲取並分析元素

TFHppleElement *element = [elements objectAtIndex:i];

 TFHppleElement對象包含許多屬性,下麵簡單介紹一下各屬性

1。

@property (nonatomic, copy, readonly) NSString *raw

 raw是包含html標記的網頁數據

<div class="bookcont">&#13;
        <ul>&#13;
         &#13;
              <span><a href="/guwen/bookv_19.aspx">學而篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_20.aspx">為政篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_21.aspx">八佾篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_22.aspx">里仁篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_23.aspx">公冶長篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_24.aspx">雍也篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_25.aspx">述而篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_26.aspx">泰伯篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_27.aspx">子罕篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_28.aspx">鄉黨篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_29.aspx">先進篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_30.aspx">顏淵篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_31.aspx">子路篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_32.aspx">憲問篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_33.aspx">衛靈公篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_34.aspx">季氏篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_35.aspx">陽貨篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_36.aspx">微子篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_37.aspx">子張篇</a></span>&#13;
               &#13;
              <span><a href="/guwen/bookv_38.aspx">堯曰篇</a></span>&#13;
              &#13;
        </ul>&#13;
        </div>
raw數據

 

 

2.content是網頁的具體數據,不包含html標記

學而篇
               
              為政篇
               
              八佾篇
               
              里仁篇
               
              公冶長篇
               
              雍也篇
               
              述而篇
               
              泰伯篇
               
              子罕篇
               
              鄉黨篇
               
              先進篇
               
              顏淵篇
               
              子路篇
               
              憲問篇
               
              衛靈公篇
               
              季氏篇
               
              陽貨篇
               
              微子篇
               
              子張篇
               
              堯曰篇
content數據

 

 

3.tagName是html標簽

輸出只有div

 

4.attributes,屬性。。。。。。。

class = bookcont;

 

 

5.children子節點

(
    "{\n    nodeContent = \"
\\n        \";\n    nodeName = text;\n}",
    "{\n    nodeChildArray =     (\n                {\n            nodeContent = \"
\\n         
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_19.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b66\\U800c\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b66\\U800c\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_19.aspx\\\">\\U5b66\\U800c\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b66\\U800c\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_19.aspx\\\">\\U5b66\\U800c\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_20.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U4e3a\\U653f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U4e3a\\U653f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_20.aspx\\\">\\U4e3a\\U653f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U4e3a\\U653f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_20.aspx\\\">\\U4e3a\\U653f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_21.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U516b\\U4f7e\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U516b\\U4f7e\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_21.aspx\\\">\\U516b\\U4f7e\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U516b\\U4f7e\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_21.aspx\\\">\\U516b\\U4f7e\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_22.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U91cc\\U4ec1\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U91cc\\U4ec1\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_22.aspx\\\">\\U91cc\\U4ec1\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U91cc\\U4ec1\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_22.aspx\\\">\\U91cc\\U4ec1\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_23.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U516c\\U51b6\\U957f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U516c\\U51b6\\U957f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_23.aspx\\\">\\U516c\\U51b6\\U957f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U516c\\U51b6\\U957f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_23.aspx\\\">\\U516c\\U51b6\\U957f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_24.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U96cd\\U4e5f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U96cd\\U4e5f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_24.aspx\\\">\\U96cd\\U4e5f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U96cd\\U4e5f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_24.aspx\\\">\\U96cd\\U4e5f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_25.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U8ff0\\U800c\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U8ff0\\U800c\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_25.aspx\\\">\\U8ff0\\U800c\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U8ff0\\U800c\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_25.aspx\\\">\\U8ff0\\U800c\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_26.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U6cf0\\U4f2f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U6cf0\\U4f2f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_26.aspx\\\">\\U6cf0\\U4f2f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U6cf0\\U4f2f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_26.aspx\\\">\\U6cf0\\U4f2f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_27.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b50\\U7f55\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b50\\U7f55\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_27.aspx\\\">\\U5b50\\U7f55\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b50\\U7f55\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_27.aspx\\\">\\U5b50\\U7f55\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_28.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U4e61\\U515a\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U4e61\\U515a\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_28.aspx\\\">\\U4e61\\U515a\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U4e61\\U515a\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_28.aspx\\\">\\U4e61\\U515a\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_29.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5148\\U8fdb\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5148\\U8fdb\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_29.aspx\\\">\\U5148\\U8fdb\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5148\\U8fdb\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_29.aspx\\\">\\U5148\\U8fdb\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_30.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U989c\\U6e0a\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U989c\\U6e0a\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_30.aspx\\\">\\U989c\\U6e0a\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U989c\\U6e0a\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_30.aspx\\\">\\U989c\\U6e0a\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_31.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b50\\U8def\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b50\\U8def\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_31.aspx\\\">\\U5b50\\U8def\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b50\\U8def\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_31.aspx\\\">\\U5b50\\U8def\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_32.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5baa\\U95ee\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5baa\\U95ee\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_32.aspx\\\">\\U5baa\\U95ee\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5baa\\U95ee\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_32.aspx\\\">\\U5baa\\U95ee\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_33.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U536b\\U7075\\U516c\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U536b\\U7075\\U516c\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_33.aspx\\\">\\U536b\\U7075\\U516c\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U536b\\U7075\\U516c\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_33.aspx\\\">\\U536b\\U7075\\U516c\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_34.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b63\\U6c0f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b63\\U6c0f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_34.aspx\\\">\\U5b63\\U6c0f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b63\\U6c0f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_34.aspx\\\">\\U5b63\\U6c0f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_35.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U9633\\U8d27\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U9633\\U8d27\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_35.aspx\\\">\\U9633\\U8d27\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U9633\\U8d27\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_35.aspx\\\">\\U9633\\U8d27\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_36.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5fae\\U5b50\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5fae\\U5b50\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_36.aspx\\\">\\U5fae\\U5b50\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5fae\\U5b50\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_36.aspx\\\">\\U5fae\\U5b50\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_37.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b50\\U5f20\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b50\\U5f20\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_37.aspx\\\">\\U5b50\\U5f20\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b50\\U5f20\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_37.aspx\\\">\\U5b50\\U5f20\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n               
\\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_38.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5c27\\U66f0\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5c27\\U66f0\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_38.aspx\\\">\\U5c27\\U66f0\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5c27\\U66f0\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_38.aspx\\\">\\U5c27\\U66f0\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"
\\n              
\\n        \";\n            nodeName = text;\n        }\n    );\n    nodeContent = \"
\\n         
\\n              \\U5b66\\U800c\\U7bc7
\\n               
\\n              \\U4e3a\\U653f\\U7bc7
\\n               
\\n              \\U516b\\U4f7e\\U7bc7
\\n               
\\n              \\U91cc\\U4ec1\\U7bc7
\\n               
\\n              \\U516c\\U51b6\\U957f\\U7bc7
\\n               
\\n              \\U96cd\\U4e5f\\U7bc7
\\n               
\\n              \\U8ff0\\U800c\\U7bc7
\\n               
\\n              \\U6cf0\\U4f2f\\U7bc7
\\n               
\\n              \\U5b50\\U7f55\\U7bc7
\\n               
\\n              \\U4e61\\U515a\\U7bc7
\\n               
\\n              \\U5148\\U8fdb\\U7bc7
\\n               
\\n              \\U989c\\U6e0a\\U7bc7
\\n               
\\n              \\U5b50\\U8def\\U7bc7
\\n               
\\n              \\U5baa\\U95ee\\U7bc7
\\n               
\\n              \\U536b\\U7075\\U516c\\U7bc7
\\n               
\\n              \\U5b63\\U6c0f\\U7bc7
\\n               
\\n              \\U9633\\U8d27\\U7bc7
\\n               
\\n              \\U5fae\\U5b50\\U7bc7
\\n               
\\n              \\U5b50\\U5f20\\U7bc7
\\n               
\\n              \\U5c27\\U66f0\\U7bc7
\\n              
\\n        \";\n    nodeName = ul;\n    raw = \"<ul>&#13;\\n         &#13;\\n              <span><a href=\\\"/guwen/bookv_19.aspx\\\">\\U5b66\\U800c\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_20.aspx\\\">\\U4e3a\\U653f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_21.aspx\\\">\\U516b\\U4f7e\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_22.aspx\\\">\\U91cc\\U4ec1\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_23.aspx\\\">\\U516c\\U51b6\\U957f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_24.aspx\\\">\\U96cd\\U4e5f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_25.aspx\\\">\\U8ff0\\U800c\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_26.aspx\\\">\\U6cf0\\U4f2f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_27.aspx\\\">\\U5b50\\U7f55\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_28.aspx\\\">\\U4e61\\U515a\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_29.aspx\\\">\\U5148\\U8fdb\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_30.aspx\\\">\\U989c\\U6e0a\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_31.aspx\\\">\\U5b50\\U8def\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_32.aspx\\\">\\U5baa\\U95ee\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_33.aspx\\\">\\U536b\\U7075\\U516c\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_34.aspx\\\">\\U5b63\\U6c0f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_35.aspx\\\">\\U9633\\U8d27\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_36.aspx\\\">\\U5fae\\U5b50\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_37.aspx\\\">\\U5b50\\U5f20\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_38.aspx\\\">\\U5c27\\U66f0\\U7bc7</a></span>&#13;\\n              &#13;\\n        </ul>\";\n}",
    "{\n    nodeContent = \"
\\n        \";\n    nodeName = text;\n}"
)
children

 

 

6.firstChild

{
    nodeContent = "
\n        ";
    nodeName = text;
}

 

上面屬性都是涉及HTML語言的標記,我們一般使用的時content屬性,然後處理得到的NSString對象

 

這樣我們就得到並處理為我們想要的數據。TFHppleElement是一個很重要的類,具體使用在這裡就不介紹了。

 


您的分享是我們最大的動力!

-Advertisement-
Play Games
更多相關文章
  • 第二個界面中的lable顯示第一個界面textField中的文本 首先我們建立一個RootViewControllers和一個DetailViewControllers,在DetailViewControllers中聲明一個textString屬性,用於接收傳過來的字元串, RootViewCont
  • 面向對象編程進階和字元串 Δ一.類的設計模式—單例 【單例】程式允許過程中,有且僅有一塊記憶體空間存在,這種類的設計模式成為單例 【問】什麼時候用到單例 【答】數據共用的時候 1.單例的書寫格式 以+(加號)開頭的方法,可以直接被類調用;-(減號)開頭方法 ,是對象方法,用對象來調用; 2.認識單例
  • 首先創建2個新界面 , 然後創建一個類,如下圖   然後在AppDeleate.h   然後在AppDeleate.m中  然後進入AppStatus.h    AppStatus.m   FirstViewController.h中    FirstViewController.m中   Seco
  • ContentProvider是不同應用程式之間進行數據交換的一種方式或者稱標準API. http://www.qq.com:80/news.jsp ContentProvider要求的;Uri與上面類似   在AndroidManifest.xml中註冊ContentProvider,指定 建立一
  •  
  •   首先:自定義一個漸變視圖,在自定義視圖中重寫drawRect  接下來主要分為5步   步驟還是和上面一樣!     作者: 清澈Saup出處: http://www.cnblogs.com/qingche/本文版權歸作者和博客園共有,歡迎轉載,但必須保留此段聲明,且在文章頁面明顯位置給出原文連
  • 1.NSString類型的屬性為什麼用copy   NSString類型的屬性可以用strong修飾,但會造成一些問題,請看下麵代碼   2016-03-15 10:10:25.772 string[619:12684] 可變字元串修改前nameStrong= zhangsan nameCopy=z
  • 在內嵌彙編中,可以將C語言表達式指定為彙編指令的操作數,而且不用去管如何將C語言表達式的值讀入哪個寄存器,以及如何將計算結果寫回C 變數,你只要告訴程式中C語言表達式與彙編指令操作數之間的對應關係即可, GCC會自動插入代碼完成必要的操作。1、簡單的內嵌彙編例:        __asm__ __v
一周排行
    -Advertisement-
    Play Games
  • 移動開發(一):使用.NET MAUI開發第一個安卓APP 對於工作多年的C#程式員來說,近來想嘗試開發一款安卓APP,考慮了很久最終選擇使用.NET MAUI這個微軟官方的框架來嘗試體驗開發安卓APP,畢竟是使用Visual Studio開發工具,使用起來也比較的順手,結合微軟官方的教程進行了安卓 ...
  • 前言 QuestPDF 是一個開源 .NET 庫,用於生成 PDF 文檔。使用了C# Fluent API方式可簡化開發、減少錯誤並提高工作效率。利用它可以輕鬆生成 PDF 報告、發票、導出文件等。 項目介紹 QuestPDF 是一個革命性的開源 .NET 庫,它徹底改變了我們生成 PDF 文檔的方 ...
  • 項目地址 項目後端地址: https://github.com/ZyPLJ/ZYTteeHole 項目前端頁面地址: ZyPLJ/TreeHoleVue (github.com) https://github.com/ZyPLJ/TreeHoleVue 目前項目測試訪問地址: http://tree ...
  • 話不多說,直接開乾 一.下載 1.官方鏈接下載: https://www.microsoft.com/zh-cn/sql-server/sql-server-downloads 2.在下載目錄中找到下麵這個小的安裝包 SQL2022-SSEI-Dev.exe,運行開始下載SQL server; 二. ...
  • 前言 隨著物聯網(IoT)技術的迅猛發展,MQTT(消息隊列遙測傳輸)協議憑藉其輕量級和高效性,已成為眾多物聯網應用的首選通信標準。 MQTTnet 作為一個高性能的 .NET 開源庫,為 .NET 平臺上的 MQTT 客戶端與伺服器開發提供了強大的支持。 本文將全面介紹 MQTTnet 的核心功能 ...
  • Serilog支持多種接收器用於日誌存儲,增強器用於添加屬性,LogContext管理動態屬性,支持多種輸出格式包括純文本、JSON及ExpressionTemplate。還提供了自定義格式化選項,適用於不同需求。 ...
  • 目錄簡介獲取 HTML 文檔解析 HTML 文檔測試參考文章 簡介 動態內容網站使用 JavaScript 腳本動態檢索和渲染數據,爬取信息時需要模擬瀏覽器行為,否則獲取到的源碼基本是空的。 本文使用的爬取步驟如下: 使用 Selenium 獲取渲染後的 HTML 文檔 使用 HtmlAgility ...
  • 1.前言 什麼是熱更新 游戲或者軟體更新時,無需重新下載客戶端進行安裝,而是在應用程式啟動的情況下,在內部進行資源或者代碼更新 Unity目前常用熱更新解決方案 HybridCLR,Xlua,ILRuntime等 Unity目前常用資源管理解決方案 AssetBundles,Addressable, ...
  • 本文章主要是在C# ASP.NET Core Web API框架實現向手機發送驗證碼簡訊功能。這裡我選擇是一個互億無線簡訊驗證碼平臺,其實像阿裡雲,騰訊雲上面也可以。 首先我們先去 互億無線 https://www.ihuyi.com/api/sms.html 去註冊一個賬號 註冊完成賬號後,它會送 ...
  • 通過以下方式可以高效,並保證數據同步的可靠性 1.API設計 使用RESTful設計,確保API端點明確,並使用適當的HTTP方法(如POST用於創建,PUT用於更新)。 設計清晰的請求和響應模型,以確保客戶端能夠理解預期格式。 2.數據驗證 在伺服器端進行嚴格的數據驗證,確保接收到的數據符合預期格 ...