首页
学习
活动
专区
圈层
工具
发布
首页
学习
活动
专区
圈层
工具
MCP广场
社区首页 >专栏 >问题帖子:java docx4j代码doc文件转pdf文件异常:Couldn‘t get [Content_Types].xml from ZipFile

问题帖子:java docx4j代码doc文件转pdf文件异常:Couldn‘t get [Content_Types].xml from ZipFile

作者头像
查拉图斯特拉说
发布2025-07-22 08:20:02
发布2025-07-22 08:20:02
15100
代码可运行
举报
文章被收录于专栏:后端架构后端架构
运行总次数:0
代码可运行

前言

碰到一个需求,首先把文档转成PDF的格式,翻了很多博客都差不多,但是实际上跑起来还是会报错,分析不出原因,我把错误也贴到下面了~看看有哪位大佬知道如何解决

导入的配置包

代码语言:javascript
代码运行次数:0
运行
复制
            <!-- For DOCX to PDF Conversion using docx4j -->
            <dependency>
                <groupId>org.docx4j</groupId>
                <artifactId>docx4j-core</artifactId>
                <version>8.3.9</version>
            </dependency>
            <dependency>
                <groupId>org.docx4j</groupId>
                <artifactId>docx4j-export-fo</artifactId>
                <version>8.3.9</version>
            </dependency>
            <!--
  docx4j 需要一个 SLF4J 的日志实现。
  如果你的项目中已经有 slf4j-log4j12, logback-classic 或其他实现,则无需添加此项。
  如果没有,可以添加下面这个简单的实现。
-->
            <dependency>
                <groupId>org.slf4j</groupId>
                <artifactId>slf4j-simple</artifactId>
                <version>1.7.32</version>
                <scope>runtime</scope>
            </dependency>

代码逻辑

代码语言:javascript
代码运行次数:0
运行
复制
import org.docx4j.Docx4J;
import org.docx4j.openpackaging.packages.WordprocessingMLPackage;
    
    
    /**
     * 使用 docx4j 将 DOCX 文件转换为 PDF 文件
     * @param docxFile 待转换的 Word 文档
     * @return 转换后生成的 PDF 文件
     * @throws Exception 转换过程中发生错误
     */
    private File convertDocxToPdf(File docxFile) throws Exception {
        if (docxFile == null || !docxFile.exists()) {
            throw new FileNotFoundException("用于转换的DOCX文件未找到: " + docxFile);
        }

        try {
            // 1. 加载 DOCX 文件
            WordprocessingMLPackage wordMLPackage = WordprocessingMLPackage.load(docxFile);

            // 2. 创建一个PDF文件的输出路径 (例如:将 "template.docx" 变为 "template.pdf")
            String pdfPath = docxFile.getAbsolutePath().replaceAll("\\.docx$", ".pdf")
                    .replaceAll("\\.doc$", ".pdf"); // 兼容 .doc
            File pdfFile = new File(pdfPath);

            // 3. 创建一个指向PDF文件的输出流
            try (OutputStream os = new FileOutputStream(pdfFile)) {
                // 4. 执行转换
                Docx4J.toPDF(wordMLPackage, os);
                os.flush();
            }

            log.info("成功将 {} 转换为 {}", docxFile.getName(), pdfFile.getName());
            return pdfFile;

        } catch (Exception e) {
            log.error("DOCX 转换为 PDF 时出错", e);
            throw new Exception("文档转换为PDF格式失败。", e);
        }
    }

异常信息

代码语言:javascript
代码运行次数:0
运行
复制
org.docx4j.openpackaging.exceptions.Docx4JException: Couldn't get [Content_Types].xml from ZipFile
	at org.docx4j.openpackaging.io3.Load3.get(Load3.java:148)
	at org.docx4j.openpackaging.packages.OpcPackage.load(OpcPackage.java:572)
	at org.docx4j.openpackaging.packages.OpcPackage.load(OpcPackage.java:421)
	at org.docx4j.openpackaging.packages.OpcPackage.load(OpcPackage.java:298)
	at org.docx4j.openpackaging.packages.OpcPackage.load(OpcPackage.java:276)
	at org.docx4j.openpackaging.packages.WordprocessingMLPackage.load(WordprocessingMLPackage.java:170)
	at com.ruoyi.system.controller.SignInfoController.convertDocxToPdf(SignInfoController.java:892)
	at com.ruoyi.system.util.DocToPdfUtils.main(DocToPdfUtils.java:41)
Caused by: org.docx4j.openpackaging.exceptions.InvalidFormatException: Bad [Content_Types].xml
	at org.docx4j.openpackaging.contenttype.ContentTypeManager.parseContentTypesFile(ContentTypeManager.java:889)
	at org.docx4j.openpackaging.io3.Load3.get(Load3.java:146)
	... 7 common frames omitted
Caused by: java.lang.RuntimeException: javax.xml.bind.JAXBException: JAXB: Can't instantiate JAXB Reference Implementation
 - with linked exception:
[java.lang.ClassNotFoundException: org.docx4j.jaxb.ri.NamespacePrefixMapper]
	at org.docx4j.XmlUtils.marshaltoString(XmlUtils.java:941)
	at org.docx4j.XmlUtils.marshaltoString(XmlUtils.java:857)
	at org.docx4j.openpackaging.contenttype.ContentTypeManager.parseContentTypesFile(ContentTypeManager.java:869)
	... 8 common frames omitted
Caused by: javax.xml.bind.JAXBException: JAXB: Can't instantiate JAXB Reference Implementation
	at org.docx4j.jaxb.NamespacePrefixMapperUtils.tryUsingRI(NamespacePrefixMapperUtils.java:95)
	at org.docx4j.jaxb.NamespacePrefixMapperUtils.getPrefixMapper(NamespacePrefixMapperUtils.java:71)
	at org.docx4j.XmlUtils.marshaltoString(XmlUtils.java:889)
	... 10 common frames omitted
Caused by: java.lang.ClassNotFoundException: org.docx4j.jaxb.ri.NamespacePrefixMapper
	at java.base/jdk.internal.loader.BuiltinClassLoader.loadClass(BuiltinClassLoader.java:581)
	at java.base/jdk.internal.loader.ClassLoaders$AppClassLoader.loadClass(ClassLoaders.java:178)
	at java.base/java.lang.ClassLoader.loadClass(ClassLoader.java:527)
	at java.base/java.lang.Class.forName0(Native Method)
	at java.base/java.lang.Class.forName(Class.java:315)
	at org.docx4j.jaxb.NamespacePrefixMapperUtils.tryUsingRI(NamespacePrefixMapperUtils.java:79)
	... 12 common frames omitted
Exception in thread "main" java.lang.Exception: 文档转换为PDF格式失败。
	at com.ruoyi.system.controller.SignInfoController.convertDocxToPdf(SignInfoController.java:911)
	at com.ruoyi.system.util.DocToPdfUtils.main(DocToPdfUtils.java:41)
Caused by: org.docx4j.openpackaging.exceptions.Docx4JException: Couldn't get [Content_Types].xml from ZipFile
	at org.docx4j.openpackaging.io3.Load3.get(Load3.java:148)
	at org.docx4j.openpackaging.packages.OpcPackage.load(OpcPackage.java:572)
	at org.docx4j.openpackaging.packages.OpcPackage.load(OpcPackage.java:421)
	at org.docx4j.openpackaging.packages.OpcPackage.load(OpcPackage.java:298)
	at org.docx4j.openpackaging.packages.OpcPackage.load(OpcPackage.java:276)
	at org.docx4j.openpackaging.packages.WordprocessingMLPackage.load(WordprocessingMLPackage.java:170)
	at com.ruoyi.system.controller.SignInfoController.convertDocxToPdf(SignInfoController.java:892)
	... 1 more
Caused by: org.docx4j.openpackaging.exceptions.InvalidFormatException: Bad [Content_Types].xml
	at org.docx4j.openpackaging.contenttype.ContentTypeManager.parseContentTypesFile(ContentTypeManager.java:889)
	at org.docx4j.openpackaging.io3.Load3.get(Load3.java:146)
	... 7 more
Caused by: java.lang.RuntimeException: javax.xml.bind.JAXBException: JAXB: Can't instantiate JAXB Reference Implementation
 - with linked exception:
[java.lang.ClassNotFoundException: org.docx4j.jaxb.ri.NamespacePrefixMapper]
	at org.docx4j.XmlUtils.marshaltoString(XmlUtils.java:941)
	at org.docx4j.XmlUtils.marshaltoString(XmlUtils.java:857)
	at org.docx4j.openpackaging.contenttype.ContentTypeManager.parseContentTypesFile(ContentTypeManager.java:869)
	... 8 more
Caused by: javax.xml.bind.JAXBException: JAXB: Can't instantiate JAXB Reference Implementation
 - with linked exception:
[java.lang.ClassNotFoundException: org.docx4j.jaxb.ri.NamespacePrefixMapper]
	at org.docx4j.jaxb.NamespacePrefixMapperUtils.tryUsingRI(NamespacePrefixMapperUtils.java:95)
	at org.docx4j.jaxb.NamespacePrefixMapperUtils.getPrefixMapper(NamespacePrefixMapperUtils.java:71)
	at org.docx4j.XmlUtils.marshaltoString(XmlUtils.java:889)
	... 10 more
Caused by: java.lang.ClassNotFoundException: org.docx4j.jaxb.ri.NamespacePrefixMapper
	at java.base/jdk.internal.loader.BuiltinClassLoader.loadClass(BuiltinClassLoader.java:581)
	at java.base/jdk.internal.loader.ClassLoaders$AppClassLoader.loadClass(ClassLoaders.java:178)
	at java.base/java.lang.ClassLoader.loadClass(ClassLoader.java:527)
	at java.base/java.lang.Class.forName0(Native Method)
	at java.base/java.lang.Class.forName(Class.java:315)
	at org.docx4j.jaxb.NamespacePrefixMapperUtils.tryUsingRI(NamespacePrefixMapperUtils.java:79)
	... 12 more
本文参与 腾讯云自媒体同步曝光计划,分享自作者个人站点/博客。
原始发表:2025-07-21,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 作者个人站点/博客 前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体同步曝光计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
目录
  • 前言
  • 导入的配置包
  • 代码逻辑
  • 异常信息
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档