2017-08-07 173 views
0

我在寫一個使用OpenXML讀取excel文件的程序。該文件還包含日期,所以我需要一個識別它們的函數。這裏在stackoverflow我發現這個解決方案:How to distinguish inline numbers from OLE Automation date numbers in OpenXML SpreadSheet?NumberingFormats始終爲空,爲什麼?

我tryed將其應用到我的代碼,但我每次碰到下面的錯誤時間:

「類型‘System.NullReferenceException’未處理的異常發生在OpenXML_Prova.exe

更多信息: 你調用的對象是空的。」

在GetDateTimeCellFormats的第一行,其中似乎NumberingFormats始終爲空。

下面是完整的代碼:

using DocumentFormat.OpenXml; 
using DocumentFormat.OpenXml.Packaging; 
using DocumentFormat.OpenXml.Spreadsheet; 
using System; 
using System.Collections.Generic; 
using System.IO; 
using System.Linq; 
using System.Text; 
using System.Text.RegularExpressions; 
using System.Threading.Tasks; 



namespace OpenXML_Prova 
{ 
    class Program 
    { 

     DocumentFormat.OpenXml.Spreadsheet.NumberingFormats numb = new NumberingFormats(); 

     static uint[] builtInDateTimeNumberFormatIDs = new uint[] { 14, 15, 16, 17, 18, 19, 20, 21, 22, /*27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 45, 46, 47, 50, 51, 52, 53, 54, 55, 56, 57, 58 */}; 
     static Dictionary<uint, NumberingFormat> builtInDateTimeNumberFormats = builtInDateTimeNumberFormatIDs.ToDictionary(id => id, id => new NumberingFormat { NumberFormatId = id }); 
     static Regex dateTimeFormatRegex = new Regex(@"((?=([^[]*\[[^[\]]*\])*([^[]*[ymdhs]+[^\]]*))|.*\[(h|mm|ss)\].*)", RegexOptions.Compiled); 

     static void Main(string[] args) { 
      Program prova = new Program(); 
      prova.ReadFile(); 
     } 

     void ReadFile() 
     { 

      var filePath = @"C:\\Users\\m.p\\Desktop\\report_fatturato_brevissimo.xlsx"; 
      using (FileStream fs = new FileStream(filePath, FileMode.Open, FileAccess.Read, FileShare.ReadWrite)) 
      { 
       using (SpreadsheetDocument doc = SpreadsheetDocument.Open(fs, false)) 
       { 
        WorkbookPart workbookPart = doc.WorkbookPart; 
        SharedStringTablePart sstpart = workbookPart.GetPartsOfType<SharedStringTablePart>().First(); 
        SharedStringTable sst = sstpart.SharedStringTable; 

        WorksheetPart worksheetPart = workbookPart.WorksheetParts.First(); 
        Worksheet sheet = worksheetPart.Worksheet; 

        var cells = sheet.Descendants<Cell>(); 
        var rows = sheet.Descendants<Row>(); 

        Console.WriteLine("Row count = {0}", rows.LongCount()); 
        Console.WriteLine("Cell count = {0}", cells.LongCount()); 

        // Or... via each row 
        foreach (Row row in rows) 
        { 
         foreach (Cell c in row.Elements<Cell>()) 
         { 
          if ((c.DataType != null) && (c.DataType == CellValues.SharedString)) 
          { 
           int ssid = int.Parse(c.CellValue.Text); 
           string str = sst.ChildElements[ssid].InnerText; 
           Console.Write/*Line*/(/*"Shared string {0}: {1}", ssid, */str + "\t"); 
          } 
          else 
          { 
           if (IsDateTimeCell(workbookPart, c) == true) 
           { 
            int ssid = int.Parse(c.CellValue.Text); 
            DateTime date = DateTime.FromOADate(Double.Parse(c.CellValue.Text)); 
            Console.Write/*Line*/(/*"Shared string {0}: {1}", ssid, */date + "\t"); 
           } 
           else if (c.CellValue != null) 
           { 
            Console.Write/*Line*/(/*"Cell contents: {0}", */c.CellValue.Text + "\t"); 
           } 
          } 
         } 
         Console.WriteLine(""); 
        } 
       } 
      } 
      Console.Read(); 
     } 


     public static Dictionary<uint, NumberingFormat> GetDateTimeCellFormats(WorkbookPart wbPart) 
     { 

//Error on the following instruction: NumberingFormats seems to be always null 
      var dateNumberFormats = wbPart.WorkbookStylesPart.Stylesheet.NumberingFormats 
       .Descendants<NumberingFormat>() 
       .Where(nf => dateTimeFormatRegex.Match(nf.FormatCode.Value).Success) 
       .ToDictionary(nf => nf.NumberFormatId.Value); 

      var cellFormats = wbPart.WorkbookStylesPart.Stylesheet.CellFormats 
       .Descendants<CellFormat>(); 

      var dateCellFormats = new Dictionary<uint, NumberingFormat>(); 
      uint styleIndex = 0; 
      foreach (var cellFormat in cellFormats) 
      { 
       if (cellFormat.ApplyNumberFormat != null && cellFormat.ApplyNumberFormat.Value) 
       { 
        if (dateNumberFormats.ContainsKey(cellFormat.NumberFormatId.Value)) 
        { 
         dateCellFormats.Add(styleIndex, dateNumberFormats[cellFormat.NumberFormatId.Value]); 
        } 
        else if (builtInDateTimeNumberFormats.ContainsKey(cellFormat.NumberFormatId.Value)) 
        { 
         dateCellFormats.Add(styleIndex, builtInDateTimeNumberFormats[cellFormat.NumberFormatId.Value]); 
        } 
       } 

       styleIndex++; 
      } 

      return dateCellFormats; 
     } 

     //Usage Example 
     public static bool IsDateTimeCell(WorkbookPart wbPart, Cell cell) 
     { 
      if (cell.StyleIndex == null) 
       return false; 

      var dateTimeCellFormats = GetDateTimeCellFormats(wbPart); 

      return dateTimeCellFormats.ContainsKey(cell.StyleIndex); 
     } 


    } 
} 

回答

0

Excel用於在電子表格單元格,基本格式OpenXML has 164 built-in formats。默認的日期時間格式是這些內置類型的一部分。

僅當您使用Excel(自定義格式)創建新格式時,它們是否會包含在StylesheetNumberingFormats部分。

我建議你換你的代碼像下面檢查NumberingFormats,避免NullReferenceException

if (wbPart.WorkbookStylesPart.Stylesheet.NumberingFormats != null) {

然後,你將需要添加一些代碼來檢測的日期時間格式。 這個SO article tries來確定可能的日期時間樣式。查看一些評論和上面的第一個鏈接,以參考提供更多細節的規範。