Changes for page Volume Mapping (On-premise)
                  Last modified by Erik Bakker on 2024/08/26 12:37
              
      
      From version  29.1 
    
    
              edited by Erik Bakker
        
on 2022/06/10 13:22
     on 2022/06/10 13:22
      Change comment:
              There is no comment for this version
          
         
      To version  57.1 
    
    
              edited by Eva Torken
        
on 2023/06/20 17:02
     on 2023/06/20 17:02
      Change comment:
              There is no comment for this version
          
         Summary
- 
          Page properties (4 modified, 0 added, 0 removed)
Details
- Page properties
- 
      - Title
-   ... ... @@ -1,1 +1,1 @@ 1 - Processinga FileperLine1 +Volume Mapping (On-premise) 
- Author
-   ... ... @@ -1,1 +1,1 @@ 1 -XWiki.e bakker1 +XWiki.etorken 
- Default language
-   ... ... @@ -1,0 +1,1 @@ 1 +en 
- Content
-   ... ... @@ -1,11 +1,13 @@ 1 1 {{container}}{{container layoutStyle="columns"}}((( 2 -In some cases, you want to treat each unique part of your input file as its message instead of processing the complete file as its message. In this microlearning, we will learn how you can process a (large) file on a per-line basis. 3 3 4 -Should you have any questions, please contact [[academy@emagiz.com>>mailto:academy@emagiz.com]]. 3 +{{warning}} 4 +Please note that this microlearning is for the new monitoring stack only. 5 +{{/warning}} 5 5 6 -* Last update: May 28th, 2021 7 -* Required reading time: 5 minutes 7 +When you need to read and write files from an on-premise disk, you need to know the path in which the data is stored and make sure that the docker container in your runtime(s) running has access to this path. There are several ways of dealing with this challenge. First, this microlearning will discuss the various alternatives and best approaches in these scenarios. 8 8 9 +Should you have any questions, please contact [[academy@emagiz.com>>mailto:academy@emagiz.com]]. 10 + 9 9 == 1. Prerequisites == 10 10 11 11 * Basic knowledge of the eMagiz platform ... ... @@ -12,93 +12,117 @@ 12 12 13 13 == 2. Key concepts == 14 14 15 -This microlearning centers around learning how to proces sanincoming file perline.17 +This microlearning centers around learning how to set up your volume mapping correctly so you can exchange file-based data on-premise. 16 16 17 -By processingper line, we mean:Splittingupthe inputdiscernable piecesthateachwillbecomeaunique message19 +By volume mapping, we mean: Creating a configuration through which the docker container can read and write data on a specific path on an on-premise machine. 18 18 19 -* Easy way of reading a file line by line and sending it to eMagiz (Low on memory) 20 -* Ability to process each line based on distinctive logic that is relevant on line level 21 -* Can be used for flat file as well as XML input files 21 +There are several options for volume mapping for your on-premise machine. 22 +* Volume 23 +* Bind mount 24 +* Temporary file system 25 +* Named pipe 22 22 23 -== 3. Processinga FileperLine ==27 +== 3. Volume Mapping (On-premise) == 24 24 25 - In somecases, youwantto treateachuniquepartofyourinputfileasitsmessageinsteadofprocessingthecompletefile as itsmessage.Inthis microlearning,we willlearnhowyou can processa(large) fileonaper-linebasis.29 +When you need to read and write files from an on-premise disk, you need to know the path in which the data is stored and make sure that the docker container in your runtime(s) running has access to this path. There are several ways of dealing with this challenge. First, this microlearning will discuss the various alternatives and best approaches in these scenarios. 26 26 27 -To make this work in eMagiz you need to navigate to the Create phase of eMagiz and open the entry flow in which you want to retrieve the file to a certain location. Within the context of this flow, we need to add functionality that will ensure that each line is read and processed separately and will become its unique message. To do so first enter "Start Editing" mode on flow level. After you have done so please add a file item reader message source to the flow. We will use this component to read and process our input file on a per-line basis. 31 +There are several options for volume mapping for your on-premise machine. 32 +* Machine volume 33 +* Bind mount 34 +* Network volume 35 +* Temporary file system 36 +* Named pipe 28 28 29 - Thefirststepwouldbetodefinethe directoryfromwhichwereadourmessages.Asalwaysreferenceto the directorywith thehelp ofaproperty.38 +Below we will explain the differences between the various options available for your volume mapping. But before we do, we first explain how to set up this configuration within eMagiz. Then, you must navigate to Deploy -> Architecture on the model level. In this overview, you can access the Volume mapping per runtime deployed on-premise. To do so, you can right-click on the runtime to access the context menu. 30 30 31 -[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity- processing-a-file-per-line--file-item-reader-directory.png]]40 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--volume-option-context-menu.png]] 32 32 33 - Secondly,justaswhenreadingthefileas a whole ensure thatyouusea filter to retrieveonlythe correctfilesfromthedirectory.42 +When you click this option, you will see the following pop-up. In this pop-up, you can define the machine-level, runtime-level volumes and network-level volumes. More on that later. This is the starting point for configuring your volume mapping. We will walk through each available option and explain how they work and should be configured. 34 34 35 - === 3.1Itemr Type===44 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--volume-mapping-pop-up.png]] 36 36 37 -No w itis time to select ourItem reader Type. Asthe helptextof the eMagiz componentsuggesttherearetwochoices with this component.The first (andmost frequentlyused) optionis the Flat file itemreader.With this option,youcan read eachline within the flat file input fileand output isata separate message. Thesecond option is called the Stax event itemreader. With thisoption,youcan readyourinput XML andoutputmessagesonaer-record basis.46 +{{info}}Note that you should be in "Start editing" mode to make any changes to the configuration of your volume mapping.{{/info}} 38 38 39 - [[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--item-reader-type-options.png]]48 +=== 3.1 Volume === 40 40 41 - Basedon your choice the exact configuration willdiffer.50 +The first Type available to you is Volume. With this option, you create one or more folders on Docker relevant to that runtime to read and write **persistent** data. To configure this Type, you need to link the runtime volume to a machine volume (or network volume) you can create within the same pop-up. This means you can re-use a "Machine volume" or a "Network volume" over multiple runtimes (i.e., containers). To do so we first need to define a machine (or network) volume. Once we have done that we can learn how to link the volume to the machine or network volume. 42 42 43 -==== 3.1.1 Stax Event ItemReader====52 +==== 3.1.1 Define Machine Volume ==== 44 44 45 - For theStaxeventitemreader, youneedto definethenameof theelementonwhichyouwanttosplittheXMLanddefinewhetheryou wanttothrowanerrorincasenosuch element existsintheinputfile(By(de)selectingtheoptionStrict).Thedefaultsettingof eMagiz is advisable forthisoption.54 +So, we first open the tab called "Machine volume." Then, by pressing the "New" button, we can define a new "Machine volume." In the pop-up that follows, we can specify the name of a machine volume and tell whether the Volume already exists on your docker installation or not. 46 46 47 -[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity- processing-a-file-per-line--stax-event-item-reader-config.png]]56 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--machine-volumes-configuration.png]] 48 48 49 - ====3.1.2FlatFileItemReader====58 +Once you have done so, we press "Save" and switch back to the "Runtime volumes" tab. 50 50 51 -For the Flat File item reader, there are some more choices and configurations to be made. There are three options you can choose from: 52 -- Pass through line mapper 53 -- Default line mapper 54 -- Pattern matching composite line mapper 60 +==== 3.1.2 Define Network Volume ==== 55 55 56 - Eachoftheseoptionshas someadvantagesanddisadvantages.Adheringto thebestpracticesof eMagiz (i.e. no transformationinthe entry)thebestoption would beto use the pass-throughlinemapper.As thenamesuggests thisoptiondoesnothingexceptgive a string backtotheflowona per linebasis.However,choosingthisoptionmeansthatthe actualtransformation fromthatstring to XML needstohappenlaterin theprocess(mostlikelyin the onramp)withthehelpofaflat-filetoXML transformer(more onthatcomponentinlatercourse).62 +So, we first open the tab called "Network volume." Then, by pressing the "New" button, we can define a new "Network volume." In the pop-up that follows, we can specify the name of a machine volume and configure the relevant information for a network volume. In most cases a CIFS is used and the only relevant options that need to be filled in are the host, port, username and password. 57 57 58 - The other two options transformthe input line intoan XML output. So you winone step in the process.However, no standardeMagiz error handling is advisablewhen you start transformingdata within the entry.So in case, something goes wrong to analyze the error will becomemore difficult. Furthermore,another potential disadvantageisthat whenonelinefails theprocessingf therestf the filealsohalts.64 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--network-volumes-configuration.png]] 59 59 60 - Fortheremainderof thismicrolearning, wewill assumethattheoptionpassthroughline mapper ischosen.66 +Once you have done so, we press "Save" and switch back to the "Runtime volumes" tab. 61 61 62 - [[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--flat-file-item-reader-passthrough.png]]68 +==== 3.1.3 Link Volume ==== 63 63 64 - As you cansee ontheBasic levelwe aredone. However,it isalways good to checkoutthesettingson theAdvanced tab,especiallyin this case,toseeifthereare additionalconfigurationoptions that couldbenefitus. The setting ofmostinterest,inthis case, is theLinestoSkipsetting (default settingis 0). Withthis setting,you can definewhetheror not youwant toprocess theheaderline(s)that existswithin your input file.Theremainderofthesettingsis (in mostcases) goodthe way eMagiz has set themup.70 +In the "Runtime volumes" tab, we push the "New" button to create a new "Runtime volume." In the following pop-up, we first need to select the Type we want to use. For this example, we use the Type called "Volume." 65 65 66 - [[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--flat-file-item-reader-passthrough-advanced.png]]72 +{{info}}Based on your selection, the relevant input fields will change.{{/info}} 67 67 68 - === 3.2 Poller===74 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--runtime-volumes-configuration-type-volume.png]] 69 69 70 - Nowthat wehave selectedandconfiguredtheitemreadertypeitbecomes time tofill in thelastpart of theconfiguration,thepoller. For polling eMagiz offers three options:76 +The first thing we need to select is the "Volume." Once we have chosen our "Volume," we must set the Target specific for this runtime. This target defines the second part of the path to which the runtime will gain access. For example, when you fill in "/target", we can combine this with the "Volume" name to arrive at the correct directory from which eMagiz needs to read data (or write data to). So in our case in which we link the volume to the machine volume we created earlier, this would be "/file-directory/target." 71 71 72 -- Fixed Delay Trigger 73 -- Fixed Rate Trigger 74 -- Cron Trigger 78 +The last setting we need to configure is to define which rights we will grant our runtime on the Volume we are creating. The default setting is read/write rights for the runtime, which is sufficient in most cases. The result of following these steps will be the following. 75 75 76 - Of these options, then trigger is used most frequentlyineMagiz. Thereasonbeingisthatyoucan define thisoptionvia aproperty that youanalter without havingtoalter theflow versionin Create.80 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--runtime-volumes-configuration-type-volume-filled-in.png]] 77 77 78 - [[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-processing-a-file-per-line--poller-config.png]]82 +{{warning}}Note that when you use the Volume option, the external system with which you exchange data on-premise via a file-based method needs to be able to write or read the data from the Volume (i.e., directory) you have configured in Docker. Should this be a problem, the Bind mount alternative discussed below should be considered.{{/warning}} 79 79 80 - Afterfinishingall these configurationsteps we can press Save tosave our work and ensurethatwe can process the input file on a per-line basis.84 +=== 3.2 Bind mount === 81 81 82 - ==4.Assignment==86 +An alternative option to read and write **persistent** data is to use the "Bind mount" option. In general, we advise using the "Volume" option because they have better performance, and bind mounts depend on the host machine's directory structure and OS. However, only some external systems can adapt to this that easily. For example, the "Bind mount" option can interest your use case. 83 83 84 -Configure an entry in which you define the component and configuration needed to process a file on a per-line basis. 85 -This assignment can be completed with the help of the (Academy) project that you have created/used in the previous assignment. 88 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--runtime-volumes-configuration-type-bind-mount.png]] 86 86 87 - ==5.Key takeaways==90 +To configure a "Bind mount," you need to define a source and a target directory linked to each other. The source directory represents the directory on your local system (that might already be used currently to exchange files). The target directory defines a directory on your docker installation that the runtime can access. 88 88 89 -* Easy way of reading a file line by line and sending it to eMagiz (Low on memory) 90 -* Ability to process each line based on distinctive logic that is relevant on line level 91 -* Can be used for flat file as well as XML input files 92 -* Try to avoid complex transformations within the entry 92 +{{warning}}Note that when the source directory does not exist, the runtimes cannot be deployed correctly. One of the following two configurations displayed below are needed to find the source directory: 93 +* /mnt/host/{local-directory} 94 +* /run/desktop/mnt/host/{local-directory}{{/warning}} 93 93 94 - == 6.SuggestedAdditionalReadings ==96 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--runtime-volumes-configuration-type-bind-mount-filled-in.png]] 95 95 96 - Therearenosuggestedadditionalreadings onthistopic98 +{{info}}Note that when you use this option, your directory reference in your flow should refer to the "target" directory configured here.{{/info}} 97 97 98 -== 7.Silent demonstrationvideo==100 +=== 3.3 Temporary file system === 99 99 100 -Th is videodemonstrateshowyoucouldhavehandledtheassignment andgives you some contextonwhatyouhavejust learned.102 +The temporary file system option is for you if you do not want to work with **persistent** data but require **non-persistent** data. This way, you can increase the container's performance by avoiding writing into the container's writable layer. 101 101 102 - {{videoattachment="novice-file-based-connectivity-processing-a-file-per-line.mp4" reference="Main.Videos.Microlearning.WebHome"/}}104 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--runtime-volumes-configuration-type-temp-file-storage.png]] 103 103 106 +To configure this option, you need a target location. On top of that, you can define the maximum size of the temporary file system. 107 + 108 +[[image:Main.Images.Microlearning.WebHome@novice-file-based-connectivity-volume-mapping-on-premise--runtime-volumes-configuration-type-temp-file-storage-filled-in.png]] 109 + 110 +{{warning}}We strongly advise you to define this number so that you can limit the potential impact this solution can have on the stability of your machine.{{/warning}} 111 + 112 +=== 3.4 Named pipe === 113 + 114 +The named pipe option can be selected, but we yet have to see a valid use case within the context of eMagiz for using this option. Therefore we won't discuss this option further in this microlearning. 115 + 116 +== 4. Key takeaways == 117 + 118 +* File-based communication on-premise changes in the new runtime architecture 119 +* There are two ways to store **persistent** data 120 + ** Volume 121 + ** Bind mount 122 +* The Volume option is considered the best alternative because they have better performance, and bind mounts are dependent on the directory structure and OS of the host machine 123 +* When you want to deal with **non-persistent** data, the Temporary file storage option is the way to go. 124 + 125 +== 5. Suggested Additional Readings == 126 + 127 +If you are interested in this topic and want more information, please read the help text provided by eMagiz. 128 + 129 + 104 104 )))((({{toc/}}))){{/container}}{{/container}} 
 
