Changes for page Volume Mapping (On-premise)
Last modified by Erik Bakker on 2024/08/26 12:37
From version 34.1
edited by Erik Bakker
on 2022/06/10 13:46
on 2022/06/10 13:46
Change comment:
There is no comment for this version
To version 30.2
edited by Erik Bakker
on 2022/06/10 13:23
on 2022/06/10 13:23
Change comment:
Update document after refactoring.
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Title
-
... ... @@ -1,1 +1,1 @@ 1 - Archiving1 +novice-file-based-connectivity-characterset - Content
-
... ... @@ -1,10 +1,10 @@ 1 1 {{container}}{{container layoutStyle="columns"}}((( 2 -In mostcases,the customerwantssome kind of insurance policytodeterminewhetherafile has enteredor left eMagiz. One way of supplying suchfunctionality for file-baseddata exchange is throughtheuseofarchivingfunctionality. With the helpof this functionalitycan you easily write thefile asreceived (or aboutto beent) to aseparatelocation. By ensuringthat thedatastays in that locationfora certain amount of timeand by giving the customer and yourself access you havea sort ofaudit trailthatdetailsthemessages that have been exchanged.This archive could also be used to analyze problems in case things go wrong.In this microlearning, we will learn how you cansetup thatarchivingandlearnhowto cleanupthearchiving toensurethat data isonly kept for a limited period.2 +In some cases, you want to treat each unique part of your input file as its message instead of processing the complete file as its message. In this microlearning, we will learn how you can process a (large) file on a per-line basis. 3 3 4 4 Should you have any questions, please contact [[academy@emagiz.com>>mailto:academy@emagiz.com]]. 5 5 6 6 * Last update: May 31th, 2021 7 -* Required reading time: 6minutes7 +* Required reading time: 7 minutes 8 8 9 9 == 1. Prerequisites == 10 10 ... ... @@ -12,81 +12,93 @@ 12 12 13 13 == 2. Key concepts == 14 14 15 -This microlearning centers around learning how to archive correctly.15 +This microlearning centers around learning how to process an incoming file per line. 16 16 17 -By archiving, we mean:Temporarily storingdataforauditpurposes andpossibleretry scenarios.17 +By processing per line, we mean: Splitting up the input into discernable pieces that each will become a unique message 18 18 19 -* Archiving isusedforauditpurposes20 -* Archi vingisusedforretryscenarios21 -* Ensurethatdatais cleanedafteraretentioneriodtokeepin controlof thedata19 +* Easy way of reading a file line by line and sending it to eMagiz (Low on memory) 20 +* Ability to process each line based on distinctive logic that is relevant on line level 21 +* Can be used for flat file as well as XML input files 22 22 23 -== 3. Archiving ==23 +== 3. Processing a File per Line == 24 24 25 -In mostcases,the customerwantssome kind of insurance policytodeterminewhetherafile has enteredor left eMagiz. One way of supplying suchfunctionality for file-baseddata exchange is throughtheuseofarchivingfunctionality. With the helpof this functionalitycan you easily write thefile asreceived (or aboutto beent) to aseparatelocation. By ensuringthat thedatastays in that locationfora certain amount of timeand by giving the customer and yourself access you havea sort ofaudit trailthatdetailsthemessages that have been exchanged.This archive could also be used to analyze problems in case things go wrong.In this microlearning, we will learn how you cansetup thatarchivingandlearnhowto cleanupthearchiving toensurethat data isonly kept for a limited period.25 +In some cases, you want to treat each unique part of your input file as its message instead of processing the complete file as its message. In this microlearning, we will learn how you can process a (large) file on a per-line basis. 26 26 27 - ===3.1Archiving itself===27 +To make this work in eMagiz you need to navigate to the Create phase of eMagiz and open the entry flow in which you want to retrieve the file to a certain location. Within the context of this flow, we need to add functionality that will ensure that each line is read and processed separately and will become its unique message. To do so first enter "Start Editing" mode on flow level. After you have done so please add a file item reader message source to the flow. We will use this component to read and process our input file on a per-line basis. 28 28 29 -T o makethiswork ineMagizyouneedto navigate totheCreate phase ofeMagiz and openthe entry flowinwhichyouwantto archivethe files. Within the context of this flow, we need toaddfunctionality that will ensurethateach input fileis archived and cleaned upwhen older than three days.To do so firstenter "Start Editing" mode onflow level. The first decisionwehaveto take ishow wearegoingtoname the fileswithinthearchiving. Thebestpractice,in this case, is theoriginalfilename+ the current timeasa suffix. You can define this by dragging a format filename generator (support object) to the canvas.29 +The first step would be to define the directory from which we read our messages. As always reference to the directory with the help of a property. 30 30 31 -[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity- archiving--file-name-generator.png]]31 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--file-item-reader-directory.png]] 32 32 33 - After we have donethispleaseadd a fileoutboundchanneladapterto the flowincludinganinputchannel.Ensure that you use apropertyforthedirectorythatreferencesanotherdirectorycomparedto theinputdirectoryto prevent creating an infinite loop.33 +Secondly, just as when reading the file as a whole ensure that you use a filter to retrieve only the correct files from the directory. 34 34 35 - [[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-archiving--archiving-config-file-outbound-basic.png]]35 +=== 3.1 Item reader Type === 36 36 37 -Now t hatwehave configuredthebasicsletus turnourattentionto theadvancedconfiguration. In the advancedtabofthis component,we needtoelectthefilenamegeneratortoensurethatthefilesarenamedcorrectly.Incaseyouprocesseach lineseparatelyyouhave tochoosewhetherto savethemasseparatefilesin thearchiveor by appendingthem again.This can beachievedbyselectingthecorrect Mode.Inmost cases,however,the default ModeofReplacewillsuffice.37 +Now it is time to select our Item reader Type. As the help text of the eMagiz component suggest there are two choices with this component. The first (and most frequently used) option is the Flat file item reader. With this option, you can read each line within the flat file input file and output is at a separate message. The second option is called the Stax event item reader. With this option, you can read your input XML and output messages on a per-record basis. 38 38 39 -[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity- archiving--archiving-config-file-outbound-advanced.png]]39 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--item-reader-type-options.png]] 40 40 41 - The moment youaresatisfiedpress Save. Nowthat we have configuredthis it becomes time to determinehow we get the neededinput to write to our archive.Inthe example we are using here we want to archive our inputfile so we need to ensure that the data we received is written tothe archive as soonas possible. To do so place a wiretap on thefirst channel after retrievingthe file. This will make sure thatthe messageis archived before processed further. The result should be somethingas shown below. Note that this same piece oflogic couldbe appliedn otherflows within the eMagiz platform in a similar manner.41 +Based on your choice the exact configuration will differ. 42 42 43 - [[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-archiving--archiving-result.png]]43 +==== 3.1.1 Stax Event Item Reader ==== 44 44 45 - ===3.2Clean up theArchive===45 +For the Stax event item reader, you need to define the name of the element on which you want to split the XML and define whether you want to throw an error in case no such element exists in the input file (By (de)selecting the option Strict). The default setting of eMagiz is advisable for this option. 46 46 47 - To ensure that thedataisnot keptindefinitely we need toclean up the archive.doso to prevent problems with disk spaceutalso to preventdata leaks of old data thatcould impactthe privacy of others. Before wecan set up the logic ineMagiz we need to talk to thecustomer to see what an acceptabletermis withinwhich thedata iskept. In most cases, this is a week or two weeks. Inhisexample, we havehosenthree days.47 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--stax-event-item-reader-config.png]] 48 48 49 - Nowthat we know the limit it is time to configure the components.We start with a composite file filter (support object).Within this filter, weatleast define how old a filemust be before it can be deleted (in milliseconds).If weturn three days intomillisecondsweget 259200000. Furthermore, weat leastdefine that we only want to deleteregularfiles.49 +==== 3.1.2 Flat File Item Reader ==== 50 50 51 -[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-archiving--file-list-filter-for-archive-cleanup.png]] 51 +For the Flat File item reader, there are some more choices and configurations to be made. There are three options you can choose from: 52 +- Pass through line mapper 53 +- Default line mapper 54 +- Pattern matching composite line mapper 52 52 53 - Havingdone sowecan add a file inboundchanneladaptertothecanvas includinganoutputchannel.Ensurethat theproperty referenceforthedirectorymatchesthe oneyouhaveusedbefore in theoutboundchanneladapter.Furthermorelinkthefiltertothecomponentanddefinethe polleraccordingto thebestpractice.56 +Each of these options has some advantages and disadvantages. Adhering to the best practices of eMagiz (i.e. no transformation in the entry) the best option would be to use the pass-through line mapper. As the name suggests this option does nothing except give a string back to the flow on a per line basis. However, choosing this option means that the actual transformation from that string to XML needs to happen later in the process (most likely in the onramp) with the help of a flat-file to XML transformer (more on that component in a later course). 54 54 55 - [[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-archiving--file-inbound-archive-cleanup.png]]58 +The other two options transform the input line into an XML output. So you win one step in the process. However, no standard eMagiz error handling is advisable when you start transforming data within the entry. So in case, something goes wrong to analyze the error will become more difficult. Furthermore, another potential disadvantage is that when one line fails the processing of the rest of the file also halts. 56 56 57 - Onething weshould notforgetwithin this configurationis to settheMax messagesper poll onthe Advancedtabofthepoller-configurationtoaufficientlyhigh number(i.e. 50). If youforget to do so and you only checkonce a day it will meanthat only one messagewillbe deleted that day.60 +For the remainder of this microlearning, we will assume that the option pass through line mapper is chosen. 58 58 59 -[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity- archiving--file-inbound-archive-cleanup-max-messages-per-poll.png]]62 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--flat-file-item-reader-passthrough.png]] 60 60 61 - NoweMagizwill checkona settime interval whethertherearefilesthatareolderthanthreedaysthat areready for deletion.Onelaststeptogo. Thislast stepwillensurethatallfiles that fitthebillwillbedeletedfromthearchive.Simplyadd astandardserviceactivatortothe canvasanddefine thefollowingSPeLexpression within thecomponent:payload.delete().64 +As you can see on the Basic level we are done. However, it is always good to check out the settings on the Advanced tab, especially in this case, to see if there are additional configuration options that could benefit us. The setting of most interest, in this case, is the Lines to Skip setting (default setting is 0). With this setting, you can define whether or not you want to process the header line(s) that exists within your input file. The remainder of the settings is (in most cases) good the way eMagiz has set them up. 62 62 63 -[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity- archiving--archive-cleanup-deletion.png]]66 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--flat-file-item-reader-passthrough-advanced.png]] 64 64 65 - Thiswillensure that each file that is retrieved will indeed be deleted fromthe archive.68 +=== 3.2 Poller === 66 66 70 +Now that we have selected and configured the item reader type it becomes time to fill in the last part of the configuration, the poller. For polling eMagiz offers three options: 71 + 72 +- Fixed Delay Trigger 73 +- Fixed Rate Trigger 74 +- Cron Trigger 75 + 76 +Of these options, the cron trigger is used most frequently in eMagiz. The reason being is that you can define this option via a property that you can alter without having to alter the flow version in Create. 77 + 78 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--poller-config.png]] 79 + 80 +After finishing all these configuration steps we can press Save to save our work and ensure that we can process the input file on a per-line basis. 81 + 67 67 == 4. Assignment == 68 68 69 -Configure an entry in which you build the archivingand thecleanup ofthe archiving.84 +Configure an entry in which you define the component and configuration needed to process a file on a per-line basis. 70 70 This assignment can be completed with the help of the (Academy) project that you have created/used in the previous assignment. 71 71 72 72 == 5. Key takeaways == 73 73 74 -* Archiving isusedforauditpurposes75 -* Archi vingisusedforretryscenarios76 -* Ensurethatdatais cleanedafteraretentioneriodtokeepin controlof thedata77 -* Don't forgetthemaxmessagesperpoll89 +* Easy way of reading a file line by line and sending it to eMagiz (Low on memory) 90 +* Ability to process each line based on distinctive logic that is relevant on line level 91 +* Can be used for flat file as well as XML input files 92 +* Try to avoid complex transformations within the entry 78 78 79 79 == 6. Suggested Additional Readings == 80 80 81 - Ifyouareinterestedin this topicandwant moreinformationon it pleasereadthe help text provided by eMagizandcheck outthe followingstorecontent:96 +There are no suggested additional readings on this topic 82 82 83 -* [[microlearning>>doc:Main.eMagiz Store.Accelerators.File Archiving.WebHome||target="blank"]] 84 -* [[microlearning>>doc:Main.eMagiz Store.Accelerators.Delete Folders.WebHome||target="blank"]] 85 - 86 86 == 7. Silent demonstration video == 87 87 88 88 This video demonstrates how you could have handled the assignment and gives you some context on what you have just learned. 89 89 90 -{{video attachment="novice-file-based-connectivity- characterset.mp4" reference="Main.Videos.Microlearning.WebHome"/}}102 +{{video attachment="novice-file-based-connectivity-processing-a-file-per-line.mp4" reference="Main.Videos.Microlearning.WebHome"/}} 91 91 92 92 )))((({{toc/}}))){{/container}}{{/container}}