我有一个复杂的长XHTML文件,其中包含CSS。在google和本网站上搜索时,我发现了一些库,它们可以用于XHTML解析:
但是,我想知道是否有任何iPhone库可以将xhtml + css文档转换为NSAttributedString
(当然只有文本)。
我一直在思考这个问题,我也有一些想法,但我认为它不会很有效率。我的主要想法是通过以下步骤形成的:
id
或class
属性的所有标记,并获取它们有效的字符串范围(我无法实现这一点)。NSDictionary
上,其中包含更多的NSDictionary
对象。就像这样:
mainDict {对象:字典{对象:@"#00ff00“键:@”颜色“对象:@"1em”键:@“字体大小”}键:@“id”对象: anotherDictionary {.}键:@“另一个id”}}NSAttributedString
属性字典上转换这些CSS属性字典。我知道这很复杂,我不需要你提供代码(当然,如果你提供它,那就太好了),我只想要指向库的链接,或者,如果它不存在,我需要一些建议来自己创建一个解析器。
当然,如果你需要更多的信息,可以通过评论来询问。
谢谢你!!
发布于 2012-06-03 02:36:19
这取决于您的需求是否可以满足您的需要,但是DTCoreText有一个HTML -> NSAttributedString转换器。它对于DTCoreText想要/需要做的事情非常具体,但它至少可以为您指明正确的方向。
发布于 2012-06-04 02:14:58
我将HTML解析为NSAttributedString的方法是将解析的节点(及其childNodes)递归地附加到NSMutableAttributedString中。
我还没有准备好在任何地方发布我的完整代码。但希望这能给你一些提示。
NSString+HTML.h
/* - toHTMLElements
* parse the string itself into a dictionary collection of htmlelements for following keys
* : @"attributedString" // html main body
* : @"insets" // images and/or videos with range info
* : @"as" // href with range info
*
*/
- (NSMutableDictionary*) toHTMLElements;
NSString+HTML.m
- (NSMutableDictionary*) toHTMLElements {
// …
// handle escape encoding here
// assume that NSString* htmlString is the processed string;
// …
NSMutableDictionary * htmlElements = [[NSMutableDictionary dictionary] retain];
NSMutableAttributedString * attributedString = [[[NSMutableAttributedString alloc] init] autorelease];
NSMutableArray * insets = [NSMutableArray array];
NSMutableArray * as = [NSMutableArray array];
[htmlElements setObject:attributedString forKey:HTML_ATTRIBUTEDSTRING];
[htmlElements setObject:insets forKey:HTML_INSETS];
[htmlElements setObject:as forKey:HTML_AS];
// parse the HTML with an XML parser
// CXXML is a variance of TBXML (http://www.tbxml.co.uk/ ) which can handle the inline tags such as <span>
// code not available to public yet, so write your own inline-tag-enabled HTML/XML parser.
CXXML * xml = [CXXML tbxmlWithXMLString:htmlString];
TBXMLElement * root = xml.rootXMLElement;
TBXMLElement * next = root->firstChild;
while (next != nil) {
//
// do something here for special treatments if needed
//
NSString * tagName = [CXXML elementName:next];
[self appendXMLElement:next withAttributes:[HTMLElementAttributes defaultAttributesFor:tagName] toHTMLElements:htmlElements];
next = next->nextSibling;
}
return [htmlElements autorelease];
}
- (void) appendXMLElement:(TBXMLElement*)aElement withAttributes:(NSDictionary*)parentAttributes toHTMLElements:(NSMutableDictionary*) htmlElements {
// do your parse of aElement and its attribute values,
// assume NSString * tagAttrString is the parsed html attribute string (either from "style" attribute or css file) for this tag like : width:200px; color:#123456;
// let an external HTMLElementAttributes class to handle the attribute updates from the parent node's attributes
NSDictionary * tagAttr = [HTMLElementAttributes updateAttributes: parentAttributes withCSSAttributes:tagAttrString];
// create your NSAttributedString styled by tagAttr
// create insets such as images / videos or hyper links objects
// then update the htmlElements for storage
// once this tag is handled, recursively visit and process the current tag's children
TBXMLElement * nextChild = aElement->firstChild;
while (nextChild != nil) {
[self appendXMLElement:nextChild withAttributes:tagAttr toHTMLElements:htmlElements];
nextChild = nextChild->nextSibling;
}
}
https://stackoverflow.com/questions/10799164
复制相似问题