無Word安裝的通用Microsoft Word文檔解析器

爲了解析：Microsoft Word 97/2003（.doc）&使用C＃和WPF而不安裝Word的Microsoft Word 2007/2010（.docx），我需要知道某人是否可以給我一個認真的圖書館，以實現這一目標。無Word安裝的通用Microsoft Word文檔解析器

技術上我遍歷throught ZipEntry的元素這樣的：

foreach (string file in _listPathFiles) 
     { 
      using (Ionic.Zip.ZipFile zip = ZipFile.Read(file)) 
      { 
       try 
       { 
        zip.ToList().ForEach(entry => 
        { 
         if (entry.FileName.EndsWith(".doc") || 
          entry.FileName.EndsWith(".docx")) 
         { 
          // Extract file into disk 
          entry.FileName = System.IO.Path.GetFileName(entry.FileName); 
          entry.Extract(baseStoragePath); 

          // Get data from file with Parser 
          string filePath = baseStoragePath + entry.FileName; 


          // Remove extracted filess 
          if (File.Exists(filePath)) 
          { 
           File.Delete(filePath); 
           Console.WriteLine("Delete : " + filePath); 
          } 
         } 
        }); 
       } 
       catch (Exception e) 
       { 
        Console.WriteLine("Fail to unzip Exception : " + e.StackTrace); 
       } 
      } 
     }

我不知道，我可以使用的ZipEntry直接獲取文檔，可能是我將不得不解析之前解壓呢？！

而且我的目標是獲取位於「標題1」Microsoft Word樣式之後的數據，因此該庫應該能夠獲得此類屬性。

圖書館的想法和代碼示例，歡迎..

來源

2014-02-18 eento

NPOI如何？ http://npoi.codeplex.com/ –

-1

退房NPOI（Apache的NOI API的.NET端口）： http://npoi.codeplex.com/

或

下載OpenXML SDK閱讀Office文檔如MS Word。

來源

2014-02-18 09:07:33

查看文檔的NPOI似乎無法管理.doc文件和OpenXML SDK，它的意思是（.docx）也不能與.doc一起工作。 – eento

無Word安裝的通用Microsoft Word文檔解析器

回答

相關問題