我想使用Spring批處理基於屬性值解析xml,下面是XML供參考。使用Spring批量解析基於屬性值的xml片段
<?xml version="1.0" encoding="UTF-8"?>
<customerInfo>
<cutommer dept="IT">
<param value="Jane" name="first-name"/>
<param value="Doe" name="last-name"/>
<param value="17 Streets" name="address"/>
<param value="1234567" name="phone-number"/>
</customer>
<cutommer dept="ES">
<param value="Jane" name="first-name"/>
<param value="Doe" name="last-name"/>
<param value="17 Streets" name="address"/>
<param value="1234567" name="phone-number"/>
</customer>
</customerInfo>
基礎上上面的XML要來解析客戶標籤,其部門屬性附加傷害值是「IT」。任何幫助是appriciated
更新1:
@Configuration
@EnableBatchProcessing
public class ControllerInfoParser_Config extends DefaultBatchConfigurer {
@Autowired
private JobBuilderFactory jobs;
@Autowired
private StepBuilderFactory steps;
@Bean
public Job parseComponentInfoXML(Step parseComponentInfo,Step partitionStep, CustomJobExecutionerListener customJobExecutionerListener)
throws UnexpectedInputException, ParseException, Exception {
return jobs.get("parseComponentInfoXML").listener(customJobExecutionerListener).start(parseComponentInfo)
.next(partitionStep).build();
}
@Bean
public Step parseComponentInfo(ItemReader<Customer> oneDeptITItemReader) throws UnexpectedInputException, ParseException, Exception {
return steps.get("parseComponentInfo").<Customer, Customer> chunk(1)
.reader(componentInfoReader()).reader(oneDeptITItemReader).processor(componentInfoProcessor())
.writer(componentInfoWriter()).build();
}
@Bean
public ItemReader<Customer> componentInfoReader() throws UnexpectedInputException, ParseException, Exception {
//OneDeptITItemReader <Customer> reader1 = new OneDeptITItemReader<Customer>();
StaxEventItemReader<Customer> reader = new StaxEventItemReader<Customer>();
reader.setResource(new ClassPathResource("xml//customer.xml"));
reader.setFragmentRootElementName("customer");
Jaxb2Marshaller marshaller = new org.springframework.oxm.jaxb.Jaxb2Marshaller();
marshaller.setClassesToBeBound(Customer.class);
// marshaller.setSchema(new ClassPathResource("xml//company.xsd"));
reader.setUnmarshaller(marshaller);
return reader;
}
@Bean
public ItemReader<Customer> oneDeptITItemReader(ItemReader<Customer> ir) {
OneDeptITItemReader<Customer> odIR = new OneDeptITItemReader<Customer>();
odIR.setDelegate(ir);
return odIR;
}
@Bean
public ItemProcessor<Customer, Customer> componentInfoProcessor() {
return new CustomerProcessor();
}
@Bean
public ItemWriter<Object> componentInfoWriter() {
return new SqlWritter();
}
}
public class OneDeptITItemReader <T> implements ItemReader <Customer>{
ItemReader<Customer> delegate;
public ItemReader<Customer> getDelegate() {
return delegate;
}
public void setDelegate(ItemReader<Customer> delegate) {
this.delegate = delegate;
}
@Override
public Customer read() {
boolean read = true;
Customer item = null;
while(read) {
try {
item = delegate.read();
} catch (Exception e) {
// TODO Auto-generated catch block
e.printStackTrace();
read =false;
}
read = !"IT".equals(item.getDept());
}
return item;
}
}
不要專注於閱讀,但在過程階段:用自定義'ItemProcessor中<客戶,客戶>'返回null部門<>「IT」或返回對象本身,如果部門是等於「IT」 –
感謝Luca提供的建議,早些時候我考慮過這種方法,但是我的XML文件在15 MB左右會很大,並且它只包含一個dept屬性值爲「IT」的片段,剩下的數千個客戶片段將不必要的解析併到達ItemProcessor 。一旦我們得到IT部門的客戶片段以避免不必要的資源消耗,是否有辦法阻止進一步的批處理流程? –