Changes for page Impact of changing topic retention
Last modified by Erik Bakker on 2024/09/03 08:22
From version 14.1
edited by Carlijn Kokkeler
on 2024/08/27 15:13
on 2024/08/27 15:13
Change comment:
There is no comment for this version
To version 10.1
edited by Erik Bakker
on 2022/10/03 13:15
on 2022/10/03 13:15
Change comment:
There is no comment for this version
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Author
-
... ... @@ -1,1 +1,1 @@ 1 -XWiki. CarlijnKokkeler1 +XWiki.ebakker - Content
-
... ... @@ -1,5 +1,5 @@ 1 1 {{container}}{{container layoutStyle="columns"}}((( 2 -In this microlearning, we'll look at the impact of changing retention polic ies for topics across time. A retention policy defines at what thresholds data is deleted from a topic.2 +In this microlearning, we'll take a look at the impact of changing retention polices for topics across time. A retention policy defines at what thresholds data is deletedf from a topic. 3 3 4 4 Should you have any questions, please get in touch with [[academy@emagiz.com>>mailto:academy@emagiz.com]]. 5 5 ... ... @@ -10,54 +10,55 @@ 10 10 11 11 == 2. Key concepts == 12 12 13 -Retention configuration in Event Streaming helps to delete data from a topic once that data (messages) reaches one of the retention thresholds. The primaryretention parameters that can be influenced in Event Streaming topics are:13 +Retention configuration in Event Streaming helps to delete data from a topic once that data (messages) reaches one of the retention thresholds. The main retention parameters that can be influenced in Event Streaming topis are: 14 14 15 -* Retention bytes -> once the topic reaches this size ,the oldest messages will be deleted from the topic (FIFO approach)15 +* Retention bytes -> once the topic reaches this size the oldest messages will be deleted from the topic (FIFO approach) 16 16 * Retention hours -> once a message on a topic is older than these retention bytes, this message will be deleted 17 17 18 -The cleanup policy is set to <delete> ,which means the message is deleted.18 +The cleanup policy is set to <delete> which effectively means the message is deleted. The cleanup policy <compact> is not used in eMagiz. 19 19 20 -{{info}}The cleanup policy <compact> is not used in eMagiz.{{/info}} 21 - 22 22 == 3. Considerations for topic retention policy == 23 23 24 24 === 3.1 When does the retention kick in? === 25 25 26 -Retention will only start when the topic size is larger than 220Mb for topics less than 200Mb. The so -called segment size is automatically calculated and is considered for topics larger than 220Mb.However,datamay stillneed tobe deletedwhen the topicsize is larger than the retention size configured.24 +Retention will only start when the topic size is larger than 220Mb for topics that are less than 200Mb. The socalled segment size is automatically calculated and is considered for topics larger than 220Mb. It may occur that data is not yet deleted at the moment the size of the topic is larger than the retention size configured. 27 27 28 28 === 3.2 Does the size in Design Architecture consider the segment size? === 29 29 30 -The total size in eMagiz displayed in the Design phase * Architecture con sidersthe complex calculations around segment sizing, cleanup policies,etc. So your sizing canalwaysbeatmost that value on our Event Streaming cluster. eMagiz uses this value to determine the required licensing.28 +The total size in eMagiz that is displayed in the Design phase * Architecture takes into account the complex calculations around segment sizing, cleanup policies etc. So your sizing can never exceed that value on our Event Streaming cluster. eMagiz uses this value to determine the required licensing. 31 31 32 32 === 3.3 Actual retention vs. budget retention === 33 33 34 - Although you set a retention hours setting, the messagescanbeso large that the retention bytes kick in earlier than the retention hours. This could cause an effect that the oldest messagesareyounger than your retention hour setting. In specific scenarios,this can be an issue.32 +Despite the fact that you set a retention hours setting, it may occur that the amount of messages is so large that the retention bytes kicks in earlier than the retention hours. This could cause an effect that the oldest messages is younger than your retention hour setting. In specific scenario's this can be an issue. 35 35 36 36 === 3.4 Managing retention across TAP === 37 37 38 -You canset the retention bytes and hours per Test, Acceptance,and Production. You can play with these configuration values by selecting a smaller value for TestthanProduction. So that each topic is optimized for the environment at hand. Onepiece ofadvice can be to keep Acceptance and Production similar in terms of configuration for retention,whereastheTest can be set to a lower value.36 +You are able to set the retention bytes and hours per Test, Acceptance and Production. You can play with these configuration values by setting a smaller value for Test compared to Production. So that each topic is optimized for the environment at hand. One advice can be to keep Acceptance and Production similar in terms of configuration for retention whereas Test can be set to a lower value. 39 39 40 -{{info}}During develop inga new business process that involves topics, only theTest can be given sufficient retention values. In this case, Acceptance and Production topics can be disabled so that the total licensed size is not exceeded.{{/info}}38 +{{info}}During development of a new business process that involves the use of topics, only test can be given sufficient retention values. In this case, Acceptance and Production topics can be disabled so that the total licensed size is not exceeded.{{/info}} 41 41 42 42 === 3.5 Duplication of data === 43 43 44 -* Currently,the standard setting for the number of partitions isone. Ifyou change this value totwoor threeand keep your retention bytes at the same level, you will have reserved data storage twiceas muchas before.Consequently,you might run out oftheallotted size toofast forcomfort. Increasing the number of partitions should result from additional consumer groupswith multiple consumerswanting toconsume dataon a particular topic.42 +* At this moment the standard setting for the number of partitions is 1. In case you change this value to 10 and keep your retention bytes at the same level, you will have a reserved data storage that is 10 times larger as before. As a consequence you might run out of allotted size to fast for your own comfort. 45 45 46 -* When designing solutions in eMagiz using Event Streaming, make sure to make the right decision on stor ing intermediate results.For example, you could store an intermediate result of a process step inthefollowingtopic that then,in turn,feeds the next step in the business process. In this case,using too many topics for the same business process causes an exponential increaseindata storage. Try to make a sound decision by combining steps or looking at alternatives.44 +* When designing solutions in eMagiz using Event Streaming, make sure to make the right decision on the storage of intermediate results. You could store an intermediate result of a process step in a next topic that then in turn feeds the next step in the business process. In this case, it might be that the use of too many topics for the same business process causes an exponential increase of data storage. Try to make a sound decision in this by combining steps or by looking at alternatives. 47 47 48 -== =3.6Increase Partitions===46 +== 4. Assignment == 49 49 50 - When you haveto increaseyour partitionseitherduetoan increasein consumerswithina consumergroup or to achievehigherthroughputin general,thereisagrationpath you oughtto consider. Toprevent deletionfrom kicking in before consumers consumedata,we advisefirst increasingthenumber of partitionsand keepingtheretention bytes stable. Consequently, you will temporarily needincreased topicstorage to facilitate the migration in thesafestwaypossible. Oncethedata is nicelydistributedover the partitionsandthe consumers haveconsumedenough data not to losedata beforeit is consumed, you should divide the currentretention bytes setting by thenumber of partitions to which you have increased that setting. So, forexample, if your presentretentionbytesis3000andyou increasethenumber of partitions to three,the newretentionbytessettinghouldbe1000 afterthemigration is finished.48 +Take a moment to review your Event Streaming solution and find the Design Architecture section. Review the additional readings to ensure the management of topics and their retention settings are clearly understood. 51 51 52 -== 4. Key takeaways ==50 +== 5. Key takeaways == 53 53 54 -* There are twoways to influence the retention of a topic - hours & bytes52 +* There are 2 ways to influence the retention of a topic - hours & bytes 55 55 * Deletion of data in a topic based on retention policy is somewhat complex - be sure to understand the size of a topic before deletion kicks in 56 -* Be careful in configuring topics so that data is not duplicated unnecessar ily and each environment has the right retention policy54 +* Be careful in configuring topics so that data is not duplicated unnecessary and each environment has the right retention policy 57 57 58 -== 5. Suggested Additional Readings == 59 -* [[Intermediate (Menu)>>doc:Main.eMagiz Academy.Microlearnings.Intermediate Level||target="blank"]] 60 -** [[Key concepts Event Streaming (Navigation)>>doc:Main.eMagiz Academy.Microlearnings.Intermediate Level.Key concepts Event Streaming||target="blank"]] 61 -*** [[Understanding retention policy in eMagiz>>doc:Main.eMagiz Academy.Microlearnings.Intermediate Level.Key concepts Event Streaming.intermediate-key-concepts-emagiz-event-streaming-retention-policy||target="blank"]] 62 -* [[Understanding topic configured size across environments>>doc:Main.eMagiz Academy.Microlearnings.Intermediate Level.Solution Architecture.intermediate-solution-architecture-topic-storage||target="blank"]] 63 -* [[Key concepts of Event Streaming>>doc:Main.eMagiz Academy.Fundamentals.fundamental-event-streaming-introduction||target="blank"]])))((({{toc/}}))){{/container}}{{/container}} 56 +== 6. Suggested Additional Readings == 57 + 58 +* [[Understanding retention policy in eMagiz>>doc:Main.eMagiz Academy.Microlearnings.Intermediate Level.Key concepts Event Streaming.intermediate-key-concepts-emagiz-event-streaming-retention-policy.WebHome||target="blank"]] 59 +* [[Understanding topic configured size across environments>>doc:Main.eMagiz Academy.Microlearnings.Intermediate Level.Solution Architecture.intermediate-solution-architecture-topic-storage.WebHome||target="blank"]] 60 +* [[Key concepts of Event Streaming>>doc:Main.eMagiz Academy.Fundamentals.fundamental-event-streaming-introduction.WebHome||target="blank"]] 61 + 62 +== 7. Silent demonstration video == 63 + 64 +As this is a more theoretical microlearning, we have no video that accompanies this microlearning.)))((({{toc/}}))){{/container}}{{/container}}