您现在的位置是：首页 > 其他

当前栏目

kkFileView优化PDF图片预览增加JPEG2000标准图片支持

PDF 标准优化支持图片增加预览

2023-06-13 09:16:44 时间

kkFileView 预览特殊 PDF 文件时白屏问题

# 前言

项目在使用 kkFileView 时接到反馈说部分 PDF 在预览时没有内容，显示空白图片。查看官方issue也发现很多类似问题，但是也没有详尽好用的解决办法。

仅仅修改依赖增加特殊标准图片的处理会降低页面打开速度，因为图片转换操作比较耗时，所以我们还需要把图片转换操作改成异步并使用多线程去增加转换效率。

下面介绍一下如何更好地处理这个 PDF 特殊图片解析问题。

# 项目修改

依赖添加。 PDF 以图片模式预览时无内容是因为包含 JPEG2000 标准的图片，而 kk 并没有添加此类图片的解析依赖，所以我们要在pom.xml添加相关依赖，这个与网上搜到的方法大致相同。

<dependency>
   <groupId>com.github.jai-imageio</groupId>
   <artifactId>jai-imageio-core</artifactId>
   <version>1.3.1</version>
</dependency>
<dependency>
   <groupId>org.apache.pdfbox</groupId>
   <artifactId>jbig2-imageio</artifactId>
   <version>3.0.2</version>
</dependency>
<dependency>
   <groupId>com.github.jai-imageio</groupId>
   <artifactId>jai-imageio-jpeg2000</artifactId>
   <version>1.3.0</version>
</dependency>

图片转换代码优化。接到转换请求之后根据线程数及 PDF 总页数分配图片转换任务给每个线程，已经转换完成的页码不会重复转换。

/**
*  pdf文件转换成jpg图片集
* @param pdfFilePath pdf文件路径
* @param pdfName pdf文件名称
* @param baseUrl 基础访问地址
* @return 图片访问集合
*/
public List<String> pdf2jpg(String pdfFilePath, String pdfName, String baseUrl) {
   List<String> imageUrls = new ArrayList<>();
   Integer imageCount = this.getConvertedPdfImage(pdfFilePath);
   String imageFileSuffix = ".jpg";
   String pdfFolder = pdfName.substring(0, pdfName.length() - 4);
   String urlPrefix;
   try {
      urlPrefix = baseUrl + URLEncoder.encode(pdfFolder, uriEncoding).replaceAll("\\+", "%20");
   } catch (UnsupportedEncodingException e) {
      logger.error("UnsupportedEncodingException", e);
      urlPrefix = baseUrl + pdfFolder;
   }

   // 如果当前pdf已缓存，则直接返回
   try  {
      PDDocument doc = PDDocument.load(new File(pdfFilePath));
      PDFRenderer pdfRendererMulti = new PDFRenderer(doc);
      int pageCount = doc.getNumberOfPages();
      int index = pdfFilePath.lastIndexOf(".");
      String folder = pdfFilePath.substring(0, index);
      for (int i = 0; i < pageCount; i++) {
            imageUrls.add(urlPrefix + "/" + i + imageFileSuffix);
      }
      Integer pdf2jpgLock = this.getConvertedPdfImage(pdfFilePath.concat("_LOCK"));
      if (pdf2jpgLock != null && pdf2jpgLock > 0 || (imageCount != null && imageCount > 0)) {
            return imageUrls;
      }
      File path = new File(folder);
      if (!path.exists() && !path.mkdirs()) {
            logger.error("创建转换文件【{}】目录失败，请检查目录权限！", folder);
      }

      CompletableFuture.runAsync(() -> {
            List<CompletableFuture> futures = new ArrayList<>();
            for (int i = 0; i < pageCount; i++) {
               int finalI = i;
               CompletableFuture<String> future = CompletableFuture.supplyAsync(() ->
                        this.pdf2jpg(pdfRendererMulti, pdfFilePath, pdfName, baseUrl, finalI), commonThreadPool);
               futures.add(future);
            }
            CompletableFuture.allOf(futures.toArray(new CompletableFuture[0])).join();
            this.addConvertedPdfImage(pdfFilePath, pageCount);
            this.addConvertedPdfImage(pdfFilePath.concat("_LOCK"), 0);
            try {
               doc.close();
               logger.info("doc close");
            } catch (IOException e) {
               logger.error("doc close error", e);
            }
      }, commonThreadPool);
   } catch (Exception e) {
      this.addConvertedPdfImage(pdfFilePath.concat("_LOCK"), 0);
      logger.error("Convert pdf to jpg exception, pdfFilePath：{}", pdfFilePath, e);
   }
   return imageUrls;
}

/**
*  pdf文件转换成jpg图片集
* @param pdfFilePath pdf文件路径
* @param pdfName pdf文件名称
* @param baseUrl 基础访问地址
* @param pageIndex 当前页
* @return 图片访问集合
*/
public String pdf2jpg(PDFRenderer pdfRendererMulti, String pdfFilePath, String pdfName, String baseUrl, int pageIndex) {
   logger.info("current thread {}, currentIndex:{}", Thread.currentThread().getName(), pageIndex);
   this.addConvertedPdfImage(pdfFilePath.concat("_LOCK"), pageIndex + 1);
   String imageFileSuffixMulti = ".jpg";
   String pdfFolder = pdfName.substring(0, pdfName.length() - 4);
   String urlPrefix;
   try {
      urlPrefix = baseUrl + URLEncoder.encode(pdfFolder, uriEncoding).replaceAll("\\+", "%20");
   } catch (UnsupportedEncodingException e) {
      logger.error("UnsupportedEncodingException", e);
      urlPrefix = baseUrl + pdfFolder;
   }

   // 判断文件是否已存在，已存在直接返回
   String imageUrl = urlPrefix + "/" + pageIndex + imageFileSuffixMulti;
   File path = new File(imageUrl);
   if (path.exists()) {
      logger.info("{} 文件已存在！", imageUrl);
      return imageUrl;
   }
   // 图片不存在需要转换
   try {
      int index = pdfFilePath.lastIndexOf(".");
      String folder = pdfFilePath.substring(0, index);

      String imageFilePath = folder + File.separator + pageIndex + imageFileSuffixMulti;
      BufferedImage imageResource = pdfRendererMulti.renderImageWithDPI(pageIndex, 105, ImageType.RGB);
      ImageIOUtil.writeImage(imageResource, imageFilePath, 105);
      // 释放对象
      imageResource.getGraphics().dispose();
      imageResource = null;
   } catch (IOException e) {
      this.addConvertedPdfImage(pdfFilePath.concat("_LOCK"), 0);
      logger.error("Convert pdf to jpg exception, pdfFilePath：{}", pdfFilePath, e);
   }
   return imageUrl;
}

页面图片加载优化。正在转换的图片是无法正常显示的，所以在加载出错时隔一段时间再去请求图片，直到图片转换完成可以成功显示为止。

<!-- 在图片加载标签内添加加载出错事件处理 -->
<div class="img-area">
  <img
    class="my-photo"
    alt="loading"
    onerror="imgError(this)"
    data-src="${img}"
    src="images/loading.gif"
  />
</div>

图片加载报错处理：

// 图片加载出错时默认显示加载动画，6秒后显示原图
function imgError(img) {
   img.setAttribute("src", "images/loading.gif") let t = setTimeout(function () {
   img.setAttribute("src", $(img).data('src')) clearTimeout(t) }, 6000)
}

# 建议

本文只是提供一个修改思路，在实际使用过程中会略微减慢 PDF 的预览速度(图片解析需要时间)，原本正常的图片也会打开地慢一点，如果确实有相关特殊 PDF 的预览需求可以参考处理。

# 参考资料

猜你喜欢

Zedis与Redis比较两个NoSQL数据库的优势（zedis和redis）
Thinkphp5学习010-项目案例-添加学生模板设计
Mysql API: Unlocking The Full Potential（mysqlapi）
Vue Router——路由
详解MySQL数据库迁移步骤：安全高效的实现方法（mysql如何迁移数据库）
搭建Redis服务端，实现高效极速体验（服务器上装redis）
51. N皇后
苹果市值一夜涨万亿元创纪录/ 华为工程师回应M7安全质疑/ 马斯克Cybertruck招生产员工…今日更多新鲜事在此
MySQL增删改查语句_MySQL comment
一匹黑马消失在脱口秀大会现场
Spring getDriverClassName方法：获取数据库驱动类的名称
Oracle Ra11带来的智慧化数据库新体验（oracle ra11）
项目差异class文件提取–>上线用
Redis以轻量型引领新时代（redis 轻量级版本）
ExtJS下grid的一些属性说明
点云数据标注_点云数据采集
更新快速更新Linux系统yum版本（linuxyum版本）