小编Kar*_*san的帖子

如何使用apachePOI从Word文档(.doc或.docx)中读取形状组作为图像?

我有一个简单的要求,提取在MS Word文件中绘制的所有图像和图表.我只能提取图像而不能提取形状组(如用例图或活动图).我想将所有Diagrams保存为图像.

我用过apachePOI.

以下代码我写了

public class worddocreader {
public static void main(String args[]) {
    FileInputStream fis;
    try {
        FileInputStream fs = new FileInputStream("F:/1.docx");
        XWPFDocument docx = new XWPFDocument(fs);
        List<XWPFPictureData> piclist = docx.getAllPictures();
        Iterator<XWPFPictureData> iterator = piclist.iterator();
        int i = 0;
        while (iterator.hasNext()) {
            XWPFPictureData pic = iterator.next();
            byte[] bytepic = pic.getData();
            BufferedImage imag = ImageIO.read(new ByteArrayInputStream(
                    bytepic));
            ImageIO.write(imag, "image/jpeg", new File("F:/docParsing/imagefromword" + i + ".jpg"));
            i++;
        }

        ArrayList<PackagePart> packArrayList = docx.getPackageRelationship().getPackage().getParts();
        int size = packArrayList.size();
        System.out.println("Array List Size : " …
Run Code Online (Sandbox Code Playgroud)

java hwpf apache-poi xwpf

6
推荐指数
1
解决办法
1675
查看次数

标签 统计

apache-poi ×1

hwpf ×1

java ×1

xwpf ×1