How do I extract images from a docx (word) file?

93 次查看(过去 30 天)
I am trying to extract images for processing from a docx (Word) file. How do I do this?

回答(2 个)

Stephen23
Stephen23 2019-5-21
编辑:Stephen23 2019-5-21
All OpenOffice XML formats (e.g. .docx, .xlsx, etc.) constitute XML files and supporting files zipped together into one file. You can simply access the image files by:
  1. Unzip the .docx into a folder (e.g. with the same name as the document).
  2. Open that folder, browse to the subfolder word\media to find all of the image files.
Use a reliable tool for unzipping, e.g. 7-zip, in which case the first step is trivial with a right-click on the document in windows explorer -> select "7-zip" -> and then choosing from its menu.
  1 个评论
Walter Roberson
Walter Roberson 2019-5-21
It is also possible to use Java methods to pull an entry from a zip archive without unzipping the whole thing. Jan posted some code about a week ago that used that interface but it might have been in a resurrected older question.

请先登录,再进行评论。


Gagan Bhangu
Gagan Bhangu 2020-7-29
Yes, you can easily extract all images from word document.
Method 1: Save your docx file as web page.
Open docx file in Microsoft Word and click on the File > Save as > and select the web page (.html) option from the save as type.
After that, you’ll see the HTML file and folder with images where you saved it on your PC.
Also, you can use Google Docs. In Docs, click on the file > Download and select save as web page.
Method 2: By Right Click on Image
You can Save image one by one fromt the MS Word by right click on the image. Select Save as picture option for this.
Method 3: Open .docx file as a .zip
Change file extension from .docx to .zip and open the zip file. Extract media folder.

类别

Help CenterFile Exchange 中查找有关 Read, Write, and Modify Image 的更多信息

产品


版本

R2019a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by