在HtmlUnit中通过Xpath获取元素,可以使用以下步骤:
<dependency>
<groupId>net.sourceforge.htmlunit</groupId>
<artifactId>htmlunit</artifactId>
<version>2.56.0</version>
</dependency>
<dependency>
<groupId>xalan</groupId>
<artifactId>xalan</artifactId>
<version>2.7.2</version>
</dependency>
</dependencies>
WebClient webClient = new WebClient(BrowserVersion.FIREFOX_78);
webClient.getOptions().setJavaScriptEnabled(true);
webClient.getOptions().setThrowExceptionOnScriptError(false);
String url = "https://example.com";
HtmlPage page = webClient.getPage(url);
List<DomNode> paragraphs = page.getByXPath("//p");
for (DomNode paragraph : paragraphs) {
System.out.println(paragraph.asText());
}
webClient.close();
这样,就可以在HtmlUnit中通过Xpath获取元素了。需要注意的是,在使用Xpath时,要确保Xpath表达式正确,否则会导致获取不到元素。
领取专属 10元无门槛券
手把手带您无忧上云