performance - Running time / Memory issue while copying from excel to SQL tables using Talend -
i copying data excel sheet sql tables. @ time around 2000 rows distributed across 18 tables. problem job is taking of time. takes around 2.5 mins job.
other issue facing memory. tried copy around 250,000 rows , couldn't run job basic settings. have increase xms , xmx allocation.
how solve these issues?
you should start job tmssqlconnection (i think that's dbms you're using) , finish tmssqlcommit component , see if helps @ talend opening large amount of connections database rather pooling them.
increasing commit size speed bulk loads if fails commit lose entire commit.
as this, long have no race conditions , don't care in order tables inserted or updated parallelise whole job either tparallelize component or enabling multi thread executions in tab under job window.
sometimes memory usage in job can improved splitting process down separate jobs , linking them child jobs in 1 large wrapper parent job trunjob components. make job more manageable.
finally, there's couple of options in advanced settings of each database output component allows increase batch size (although increase memory usage) , enable parallel connections can improve performance utilising more database server cores.
your memory issues unlikely resolved short of re-engineering job deal smaller chunks of data @ time , commit each part , grab next lot.
this done using tfilterrow component , selecting first x records (by filter condition, if data set has none add 1 first preprocessing give every row numeric.sequence
), processing , putting in table , picking next x records , on.
Comments
Post a Comment