{"id":16763,"date":"2024-05-07T15:22:09","date_gmt":"2024-05-07T13:22:09","guid":{"rendered":"https:\/\/www.kickmaker.fr\/blog\/?p=16763"},"modified":"2024-05-07T15:22:43","modified_gmt":"2024-05-07T13:22:43","slug":"harnessing-synthetic-data-a-cornerstone-of-ai-strategy","status":"publish","type":"post","link":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/","title":{"rendered":"Harnessing synthetic data: a cornerstone of AI strategy"},"content":{"rendered":"[vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][image_with_animation image_url=&#8221;16308&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;Fade In&#8221; animation_easing=&#8221;default&#8221; animation_movement_type=&#8221;transform_y&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;none&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;100%&#8221; max_width_mobile=&#8221;default&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text text_direction=&#8221;default&#8221;]Implementing artificial intelligence in industrial settings is a widespread aspiration. However, the journey is riddled with formidable challenges.<\/p>\n<p>Practitioners often encounter the disheartening reality of witnessing promising outcomes in academic research, only to confront significant degradation when attempting real-world application.<\/p>\n<p>Numerous factors contribute to this discrepancy, ranging from the relative infancy of Deep Learning theory to the complexities of identifying robust model axes suitable for industrial deployment.<\/p>\n<p>Yet, amidst these challenges, a consistent bottleneck emerges: the scarcity of quality data, which invariably halts progress. Overcoming these limitations demands a refined methodology that facilitates iterative refinement of Deep Learning solutions for both training and model evaluation.<\/p>\n<p>In this article, we unveil strategies rooted in synthetic data and introduce a robust methodology designed to foster a tranquil working environment amidst these complexities[\/vc_column_text][\/vc_column][\/vc_row][vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_row_inner column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; text_align=&#8221;left&#8221; row_position=&#8221;default&#8221; row_position_tablet=&#8221;inherit&#8221; row_position_phone=&#8221;inherit&#8221; overflow=&#8221;visible&#8221; pointer_events=&#8221;all&#8221;][vc_column_inner column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; overflow=&#8221;visible&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text text_direction=&#8221;default&#8221;]<strong>Contents :<\/strong><\/p>\n<ol>\n<li><a href=\"#il-n-y-a-jamais-assez-de-donn\u00e9e\">Never enough data<\/a><\/li>\n<li><a href=\"#une-solution-evidente\">One obvious solution: generate data<\/a><\/li>\n<li><a href=\"#les-approches-naives\">Naive approaches are bound to fail<\/a><\/li>\n<li><a href=\"#une-methodologie-indispensable\">An indispensable methodology<\/a><\/li>\n<\/ol>\n[\/vc_column_text][\/vc_column_inner][\/vc_row_inner][\/vc_column][\/vc_row][vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; id=&#8221;il-n-y-a-jamais-assez-de-donn\u00e9e&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text el_id=&#8221;il-n-y-a-jamais-assez&#8221; text_direction=&#8221;default&#8221;]\n<h2>Never enough data<\/h2>\n[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][vc_column_text text_direction=&#8221;default&#8221;]Any enthusiast who has ever taken an interest in the subject will be aware of this problem: in AI, we need an inordinate amount of data. Always more&#8221; is an axiom that can quickly become exhausting.<\/p>\n<p>In the academic world, major players such as Google and its JFT Dataset have accumulated dozens of millions of different images to train their networks.<br \/>\nHowever, holding such a large quantity of data is very often out of reach for an industrial player.<\/p>\n<p>It should also be remembered that, to be usable, data must be annotated by a human actor. Even if we can greatly accelerate this annotation, it must be rigorously controlled by experts.<br \/>\nThis is because the annotations will serve as much to train (create) our Deep Learning models as to qualify their predictive quality.<br \/>\nIn the latter case, data annotated too quickly will lead to a model that has the illusion of working well, until the fateful day of industrialization when the model collapses.<\/p>\n<p>Be careful, however, as the aim is not simply to accumulate lots of data. The data must be sufficiently varied to reproduce the distribution of the problem we wish to address. The data must therefore reproduce a sufficient variety of cases, with a certain balance.<br \/>\nHowever, whatever the industrial subject, the cases where we lack data are typically those where its acquisition is more complex or costly. Examples: rare cases, cases linked to the operation of an industrial process that&#8217;s costly to interrupt, etc.<\/p>\n<p>The task then quickly seems impossible, and may lead us to look for other ways of improving an AI model. For example, by working on a new Deep Learning architecture (a new detection backbone, for example), or by delving into hyper-parameters (via a Bayes+HyperBand approach, for example, or in Tensor V Programs).<\/p>\n<p>These areas for improvement are real, but in our experience, they generally produce much less interesting improvements than richer, better-controlled data.[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][image_with_animation image_url=&#8221;16303&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;Fade In&#8221; animation_easing=&#8221;default&#8221; animation_movement_type=&#8221;transform_y&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;center&#8221; border_radius=&#8221;none&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;100%&#8221; max_width_mobile=&#8221;default&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; id=&#8221;une-solution-evidente&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text text_direction=&#8221;default&#8221;]\n<h2>One obvious solution : generate data<\/h2>\n[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][vc_column_text text_direction=&#8221;default&#8221;]One solution, which has emerged in recent years in the scientific world, and has been tried with varying degrees of success in the industrial world, is to generate data to increase datasets.<\/p>\n<p>This approach has become a research canon, particularly since OpenAI&#8217;s work on &#8220;domain randomization&#8221;, in which researchers trained a robotic agent in a fully simulated environment and then applied it to the real world ( <a href=\"https:\/\/arxiv.org\/abs\/1703.06907\">Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World, Tobin et al,<\/a> ). More recently, OpenAI and Berkeley have used these approaches to train a quadcopter to learn to move in an open environment (<a href=\"https:\/\/arxiv.org\/abs\/2106.05963\">Coupling Vision and Proprioception for Navigation of Legged Robots, Fu et al<\/a>), see diagram below :[\/vc_column_text][divider line_type=&#8221;No Line&#8221; custom_height=&#8221;20&#8243;][image_with_animation image_url=&#8221;16235&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;Fade In&#8221; animation_easing=&#8221;default&#8221; animation_movement_type=&#8221;transform_y&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;none&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;100%&#8221; max_width_mobile=&#8221;default&#8221;][vc_column_text text_direction=&#8221;default&#8221;]Robotics has seized on this approach, but problems such as feature detection and localization, or anomaly detection, can also benefit from these methodologies.<\/p>\n<p>It could even be argued that we&#8217;re getting closer to the <a href=\"https:\/\/en.wikipedia.org\/wiki\/Self-supervised_learning\" target=\"_blank\" rel=\"noopener\">Self Supervised Learning <\/a>paradigm, which aims to train a model on more general data to learn low-level representations, and then specialize it on a particular subject.<\/p>\n<p>In particular, this approach would make it possible to test a model more extensively: on the one hand, by targeting controlled variances in the generated data and observing the model&#8217;s results; and on the other, by maximizing the amount of real data used in testing, which statistically robustifies the result metrics.<\/p>\n<p>So, is it enough for an industrial problem to generate synthetic data to overcome the problems?<\/p>\n<p>Sadly, and unsurprisingly, no.[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; id=&#8221;les-approches-naives&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text text_direction=&#8221;default&#8221;]\n<h2>Naive approaches are bound to fail<\/h2>\n[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][vc_column_text text_direction=&#8221;default&#8221;]We&#8217;ve already seen this. A piece of data, such as a photo-realistic image, is generated with its annotations. An AI model is trained on this synthetic data and seems to deliver good scores.<\/p>\n<p>But when applied to real data, the results collapse dramatically.<br \/>\nWe are back to the Distribution Drift demon specific to Deep Learning&#8230; A decisive element in the search for a solution comes from an important work carried out in 2021 by <a href=\"https:\/\/arxiv.org\/abs\/2106.05963\">Baradad et al : Learning to See by Looking at Noise<\/a>.<\/p>\n<p>In this publication, the authors illustrate the<em> Representation Learning<\/em> intuition on learning hierarchical representations of information.<br \/>\nThey illustrate that it is possible to pre-train a model on images that have very little to do with the target problem (in this case, noise images, or out-of-topic generations), provided that these training images can accompany the creation within the neural network of useful fundamental representations.[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][image_with_animation image_url=&#8221;16238&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;Fade In&#8221; animation_easing=&#8221;default&#8221; animation_movement_type=&#8221;transform_y&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;none&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;100%&#8221; max_width_mobile=&#8221;default&#8221;][vc_column_text text_direction=&#8221;default&#8221;]The basic idea is this: we don&#8217;t want a synthetic data set that simulates the target problem. We want a hierarchy of synthetic data in which we can embed the target problem, which will then become a special case of our generation. With a solid methodology in place, we&#8217;ll be able to iterate on concrete improvements to the model.[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; id=&#8221;une-methodologie-indispensable&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text text_direction=&#8221;default&#8221;]\n<h2>An indispensable methodology<\/h2>\n[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][vc_column_text text_direction=&#8221;default&#8221;]How do you implement such a project? Methodology is, unsurprisingly, the sinews of war if we are to guarantee our customers a correct improvement of the model.<\/p>\n<p><strong>1. Understanding distribution<\/strong><\/p>\n<p>A dataset is basically a sampling that represents a wider distribution, that of the problem we want to address. A distribution can be analyzed, measured and mapped using available data. An initial analysis of this distribution, based on the data, combined with business expertise in the targeted problem, is obviously essential to any subsequent action.<\/p>\n<p><strong>2. Drowning out distribution<\/strong><\/p>\n<p>Let&#8217;s face it: we won&#8217;t be able to create data that&#8217;s totally indistinguishable from the missing data, so there&#8217;s no point wasting months on a quest for perfection that&#8217;s doomed to failure. Because even if you can&#8217;t see any difference visually between your synthetic data and your real data, a Deep Learning model will be perfectly capable of using &#8220;invisible&#8221; differences linked to a sensor model, a form of noise linked to optics, and so on. So it&#8217;s best to try to nest the distribution in a properly framed hierarchical approach, along the lines of\u00a0Baradad et al.<\/p>\n<p><strong>3. Controlling generation<\/strong><\/p>\n<p>Generation that can&#8217;t be finely controlled is of limited interest. Sooner or later, we&#8217;ll want to generate specific cases corresponding to &#8220;holes&#8221; in the data.<br \/>\nBut we can only discover these special cases by testing the model against real or synthetic data and identifying the variances where the model is at fault.<\/p>\n<p>So let&#8217;s leave aside Generative Adversarial Networks or VQ-Variational Autoencoders, at least initially, in favor of a perfectly mastered deterministic system. It will always be possible, at a later stage, to use these tools sparingly, for example for domain transfer, or for conditioned generation, as in the case of recent diffusion models.<\/p>\n<p><strong>4. Having a compass<\/strong><\/p>\n<p>Probably the most important. No-one wants to wait months for synthetic data, only to find that it does absolutely nothing to improve the model. We work with our customers to determine a &#8220;compass&#8221;, in other words, a model that is easy to train and linked to the target data, on which an improvement can be observed. This makes it possible to iterate and check that the synthesis work is going in the right direction.<\/p>\n<p><strong>5. Adapting workouts<\/strong><\/p>\n<p>Synthetic data has a dual purpose: to pre-train a model, and to complete training on specific cases. Model training must therefore be weighted according to the neural network architecture and objective, in order to optimize final model quality.[\/vc_column_text][\/vc_column][\/vc_row][vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/1&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text text_direction=&#8221;default&#8221;]Today, too many projects are blocked by lack of data. Even if a sufficient dataset has been accumulated to train a first model, when we want to iterate to improve this tool, we need to identify missing variances and address them. A synthetic generation tool thus enables us to incrementally improve a model by improving the training data, just as it enables us to correctly test an AI model by questioning each variance in the data in isolation.<\/p>\n<p><strong>In other words, synthetic data is an excellent way of moving beyond POC and industrializing these new tools.<\/strong>[\/vc_column_text][divider line_type=&#8221;No Line&#8221;][vc_column_text]Article \u00e9crit par <a href=\"https:\/\/www.linkedin.com\/in\/eric-debeir-a99bb994\/\">Eric Debeir<\/a>[\/vc_column_text][\/vc_column][\/vc_row]\n<noscript class=\"ninja-forms-noscript-message\">\n\tNotice: JavaScript is required for this content.<\/noscript>\n<div id=\"nf-form-2-cont\" class=\"nf-form-cont\" aria-live=\"polite\" aria-labelledby=\"nf-form-title-2\" aria-describedby=\"nf-form-errors-2\" role=\"form\">\n\n    <div class=\"nf-loading-spinner\"><\/div>\n\n<\/div>\n        <!-- That data is being printed as a workaround to page builders reordering the order of the scripts loaded-->\n        <script>var formDisplay=1;var nfForms=nfForms||[];var form=[];form.id='2';form.settings={\"objectType\":\"Form Setting\",\"editActive\":true,\"title\":\"Have a project in mind?\",\"created_at\":\"2016-08-24 16:39:20\",\"form_title\":\"Contact Me\",\"default_label_pos\":\"above\",\"show_title\":\"1\",\"clear_complete\":\"1\",\"hide_complete\":\"1\",\"logged_in\":\"0\",\"key\":\"\",\"conditions\":[],\"wrapper_class\":\"\",\"element_class\":\"\",\"add_submit\":\"1\",\"not_logged_in_msg\":\"\",\"sub_limit_number\":\"\",\"sub_limit_msg\":\"\",\"calculations\":[],\"formContentData\":[\"name_and_surname_1711466868165\",\"email\",\"message\",\"submit\"],\"container_styles_background-color\":\"\",\"container_styles_border\":\"\",\"container_styles_border-style\":\"\",\"container_styles_border-color\":\"\",\"container_styles_color\":\"\",\"container_styles_height\":\"\",\"container_styles_width\":\"\",\"container_styles_font-size\":\"\",\"container_styles_margin\":\"\",\"container_styles_padding\":\"\",\"container_styles_display\":\"\",\"container_styles_float\":\"\",\"container_styles_show_advanced_css\":\"0\",\"container_styles_advanced\":\"\",\"title_styles_background-color\":\"\",\"title_styles_border\":\"\",\"title_styles_border-style\":\"\",\"title_styles_border-color\":\"\",\"title_styles_color\":\"\",\"title_styles_height\":\"\",\"title_styles_width\":\"\",\"title_styles_font-size\":\"\",\"title_styles_margin\":\"\",\"title_styles_padding\":\"\",\"title_styles_display\":\"\",\"title_styles_float\":\"\",\"title_styles_show_advanced_css\":\"0\",\"title_styles_advanced\":\"\",\"row_styles_background-color\":\"\",\"row_styles_border\":\"\",\"row_styles_border-style\":\"\",\"row_styles_border-color\":\"\",\"row_styles_color\":\"\",\"row_styles_height\":\"\",\"row_styles_width\":\"\",\"row_styles_font-size\":\"\",\"row_styles_margin\":\"\",\"row_styles_padding\":\"\",\"row_styles_display\":\"\",\"row_styles_show_advanced_css\":\"0\",\"row_styles_advanced\":\"\",\"row-odd_styles_background-color\":\"\",\"row-odd_styles_border\":\"\",\"row-odd_styles_border-style\":\"\",\"row-odd_styles_border-color\":\"\",\"row-odd_styles_color\":\"\",\"row-odd_styles_height\":\"\",\"row-odd_styles_width\":\"\",\"row-odd_styles_font-size\":\"\",\"row-odd_styles_margin\":\"\",\"row-odd_styles_padding\":\"\",\"row-odd_styles_display\":\"\",\"row-odd_styles_show_advanced_css\":\"0\",\"row-odd_styles_advanced\":\"\",\"success-msg_styles_background-color\":\"\",\"success-msg_styles_border\":\"\",\"success-msg_styles_border-style\":\"\",\"success-msg_styles_border-color\":\"\",\"success-msg_styles_color\":\"\",\"success-msg_styles_height\":\"\",\"success-msg_styles_width\":\"\",\"success-msg_styles_font-size\":\"\",\"success-msg_styles_margin\":\"\",\"success-msg_styles_padding\":\"\",\"success-msg_styles_display\":\"\",\"success-msg_styles_show_advanced_css\":\"0\",\"success-msg_styles_advanced\":\"\",\"error_msg_styles_background-color\":\"\",\"error_msg_styles_border\":\"\",\"error_msg_styles_border-style\":\"\",\"error_msg_styles_border-color\":\"\",\"error_msg_styles_color\":\"\",\"error_msg_styles_height\":\"\",\"error_msg_styles_width\":\"\",\"error_msg_styles_font-size\":\"\",\"error_msg_styles_margin\":\"\",\"error_msg_styles_padding\":\"\",\"error_msg_styles_display\":\"\",\"error_msg_styles_show_advanced_css\":\"0\",\"error_msg_styles_advanced\":\"\",\"allow_public_link\":0,\"embed_form\":\"\",\"form_title_heading_level\":\"3\",\"changeEmailErrorMsg\":\"Please enter a valid email address!\",\"changeDateErrorMsg\":\"Please enter a valid date!\",\"confirmFieldErrorMsg\":\"These fields must match!\",\"fieldNumberNumMinError\":\"Number Min Error\",\"fieldNumberNumMaxError\":\"Number Max Error\",\"fieldNumberIncrementBy\":\"Please increment by \",\"formErrorsCorrectErrors\":\"Please correct errors before submitting this form.\",\"validateRequiredField\":\"This is a required field.\",\"honeypotHoneypotError\":\"Honeypot Error\",\"fieldsMarkedRequired\":\"Field marked with an * are required.\",\"currency\":\"\",\"unique_field_error\":\"A form with this value has already been submitted.\",\"drawerDisabled\":false,\"objectDomain\":\"display\",\"ninjaForms\":\"Ninja Forms\",\"fieldTextareaRTEInsertLink\":\"Insert Link\",\"fieldTextareaRTEInsertMedia\":\"Insert Media\",\"fieldTextareaRTESelectAFile\":\"Select a file\",\"formHoneypot\":\"If you are a human seeing this field, please leave it empty.\",\"fileUploadOldCodeFileUploadInProgress\":\"File Upload in Progress.\",\"fileUploadOldCodeFileUpload\":\"FILE UPLOAD\",\"currencySymbol\":false,\"thousands_sep\":\",\",\"decimal_point\":\".\",\"siteLocale\":\"en_US\",\"dateFormat\":\"m\\\/d\\\/Y\",\"startOfWeek\":\"1\",\"of\":\"of\",\"previousMonth\":\"Previous Month\",\"nextMonth\":\"Next Month\",\"months\":[\"January\",\"February\",\"March\",\"April\",\"May\",\"June\",\"July\",\"August\",\"September\",\"October\",\"November\",\"December\"],\"monthsShort\":[\"Jan\",\"Feb\",\"Mar\",\"Apr\",\"May\",\"Jun\",\"Jul\",\"Aug\",\"Sep\",\"Oct\",\"Nov\",\"Dec\"],\"weekdays\":[\"Sunday\",\"Monday\",\"Tuesday\",\"Wednesday\",\"Thursday\",\"Friday\",\"Saturday\"],\"weekdaysShort\":[\"Sun\",\"Mon\",\"Tue\",\"Wed\",\"Thu\",\"Fri\",\"Sat\"],\"weekdaysMin\":[\"Su\",\"Mo\",\"Tu\",\"We\",\"Th\",\"Fr\",\"Sa\"],\"recaptchaConsentMissing\":\"reCaptcha validation couldn&#039;t load.\",\"recaptchaMissingCookie\":\"reCaptcha v3 validation couldn&#039;t load the cookie needed to submit the form.\",\"recaptchaConsentEvent\":\"Accept reCaptcha cookies before sending the form.\",\"currency_symbol\":\"\",\"beforeForm\":\"\",\"beforeFields\":\"\",\"afterFields\":\"\",\"afterForm\":\"\"};form.fields=[{\"objectType\":\"Field\",\"objectDomain\":\"fields\",\"editActive\":false,\"order\":1,\"idAttribute\":\"id\",\"label\":\"Name &amp; surname\",\"key\":\"name_and_surname_1711466868165\",\"type\":\"textbox\",\"created_at\":\"2016-08-24 16:39:20\",\"label_pos\":\"above\",\"required\":0,\"placeholder\":\"\",\"default\":\"\",\"wrapper_class\":\"\",\"element_class\":\"\",\"container_class\":\"\",\"input_limit\":\"\",\"input_limit_type\":\"characters\",\"input_limit_msg\":\"Character(s) left\",\"manual_key\":\"\",\"disable_input\":\"\",\"admin_label\":\"\",\"help_text\":\"\",\"desc_text\":\"\",\"disable_browser_autocomplete\":\"\",\"mask\":\"\",\"custom_mask\":\"\",\"wrap_styles_background-color\":\"\",\"wrap_styles_border\":\"\",\"wrap_styles_border-style\":\"\",\"wrap_styles_border-color\":\"\",\"wrap_styles_color\":\"\",\"wrap_styles_height\":\"\",\"wrap_styles_width\":\"\",\"wrap_styles_font-size\":\"\",\"wrap_styles_margin\":\"\",\"wrap_styles_padding\":\"\",\"wrap_styles_display\":\"\",\"wrap_styles_float\":\"\",\"wrap_styles_show_advanced_css\":0,\"wrap_styles_advanced\":\"\",\"label_styles_background-color\":\"\",\"label_styles_border\":\"\",\"label_styles_border-style\":\"\",\"label_styles_border-color\":\"\",\"label_styles_color\":\"\",\"label_styles_height\":\"\",\"label_styles_width\":\"\",\"label_styles_font-size\":\"\",\"label_styles_margin\":\"\",\"label_styles_padding\":\"\",\"label_styles_display\":\"\",\"label_styles_float\":\"\",\"label_styles_show_advanced_css\":0,\"label_styles_advanced\":\"\",\"element_styles_background-color\":\"\",\"element_styles_border\":\"\",\"element_styles_border-style\":\"\",\"element_styles_border-color\":\"\",\"element_styles_color\":\"\",\"element_styles_height\":\"\",\"element_styles_width\":\"\",\"element_styles_font-size\":\"\",\"element_styles_margin\":\"\",\"element_styles_padding\":\"\",\"element_styles_display\":\"\",\"element_styles_float\":\"\",\"element_styles_show_advanced_css\":0,\"element_styles_advanced\":\"\",\"cellcid\":\"c3277\",\"field_label\":\"Name\",\"field_key\":\"name\",\"custom_name_attribute\":\"\",\"personally_identifiable\":\"\",\"value\":\"\",\"drawerDisabled\":false,\"id\":5,\"beforeField\":\"\",\"afterField\":\"\",\"parentType\":\"textbox\",\"element_templates\":[\"textbox\",\"input\"],\"old_classname\":\"\",\"wrap_template\":\"wrap\"},{\"objectType\":\"Field\",\"objectDomain\":\"fields\",\"editActive\":false,\"order\":2,\"idAttribute\":\"id\",\"label\":\"Email\",\"key\":\"email\",\"type\":\"email\",\"created_at\":\"2016-08-24 16:39:20\",\"label_pos\":\"above\",\"required\":1,\"placeholder\":\"\",\"default\":\"\",\"wrapper_class\":\"\",\"element_class\":\"\",\"container_class\":\"\",\"admin_label\":\"\",\"help_text\":\"\",\"desc_text\":\"\",\"wrap_styles_background-color\":\"\",\"wrap_styles_border\":\"\",\"wrap_styles_border-style\":\"\",\"wrap_styles_border-color\":\"\",\"wrap_styles_color\":\"\",\"wrap_styles_height\":\"\",\"wrap_styles_width\":\"\",\"wrap_styles_font-size\":\"\",\"wrap_styles_margin\":\"\",\"wrap_styles_padding\":\"\",\"wrap_styles_display\":\"\",\"wrap_styles_float\":\"\",\"wrap_styles_show_advanced_css\":0,\"wrap_styles_advanced\":\"\",\"label_styles_background-color\":\"\",\"label_styles_border\":\"\",\"label_styles_border-style\":\"\",\"label_styles_border-color\":\"\",\"label_styles_color\":\"\",\"label_styles_height\":\"\",\"label_styles_width\":\"\",\"label_styles_font-size\":\"\",\"label_styles_margin\":\"\",\"label_styles_padding\":\"\",\"label_styles_display\":\"\",\"label_styles_float\":\"\",\"label_styles_show_advanced_css\":0,\"label_styles_advanced\":\"\",\"element_styles_background-color\":\"\",\"element_styles_border\":\"\",\"element_styles_border-style\":\"\",\"element_styles_border-color\":\"\",\"element_styles_color\":\"\",\"element_styles_height\":\"\",\"element_styles_width\":\"\",\"element_styles_font-size\":\"\",\"element_styles_margin\":\"\",\"element_styles_padding\":\"\",\"element_styles_display\":\"\",\"element_styles_float\":\"\",\"element_styles_show_advanced_css\":0,\"element_styles_advanced\":\"\",\"cellcid\":\"c3281\",\"field_label\":\"Email\",\"field_key\":\"email\",\"custom_name_attribute\":\"email\",\"personally_identifiable\":1,\"value\":\"\",\"drawerDisabled\":false,\"id\":6,\"beforeField\":\"\",\"afterField\":\"\",\"parentType\":\"email\",\"element_templates\":[\"email\",\"input\"],\"old_classname\":\"\",\"wrap_template\":\"wrap\"},{\"objectType\":\"Field\",\"objectDomain\":\"fields\",\"editActive\":false,\"order\":3,\"idAttribute\":\"id\",\"label\":\"Message\",\"key\":\"message\",\"type\":\"textarea\",\"created_at\":\"2016-08-24 16:39:20\",\"label_pos\":\"above\",\"required\":1,\"placeholder\":\"\",\"default\":\"\",\"wrapper_class\":\"\",\"element_class\":\"\",\"container_class\":\"\",\"input_limit\":\"\",\"input_limit_type\":\"characters\",\"input_limit_msg\":\"Character(s) left\",\"manual_key\":\"\",\"disable_input\":\"\",\"admin_label\":\"\",\"help_text\":\"\",\"desc_text\":\"\",\"disable_browser_autocomplete\":\"\",\"textarea_rte\":\"\",\"disable_rte_mobile\":\"\",\"textarea_media\":\"\",\"wrap_styles_background-color\":\"\",\"wrap_styles_border\":\"\",\"wrap_styles_border-style\":\"\",\"wrap_styles_border-color\":\"\",\"wrap_styles_color\":\"\",\"wrap_styles_height\":\"\",\"wrap_styles_width\":\"\",\"wrap_styles_font-size\":\"\",\"wrap_styles_margin\":\"\",\"wrap_styles_padding\":\"\",\"wrap_styles_display\":\"\",\"wrap_styles_float\":\"\",\"wrap_styles_show_advanced_css\":0,\"wrap_styles_advanced\":\"\",\"label_styles_background-color\":\"\",\"label_styles_border\":\"\",\"label_styles_border-style\":\"\",\"label_styles_border-color\":\"\",\"label_styles_color\":\"\",\"label_styles_height\":\"\",\"label_styles_width\":\"\",\"label_styles_font-size\":\"\",\"label_styles_margin\":\"\",\"label_styles_padding\":\"\",\"label_styles_display\":\"\",\"label_styles_float\":\"\",\"label_styles_show_advanced_css\":0,\"label_styles_advanced\":\"\",\"element_styles_background-color\":\"\",\"element_styles_border\":\"\",\"element_styles_border-style\":\"\",\"element_styles_border-color\":\"\",\"element_styles_color\":\"\",\"element_styles_height\":\"\",\"element_styles_width\":\"\",\"element_styles_font-size\":\"\",\"element_styles_margin\":\"\",\"element_styles_padding\":\"\",\"element_styles_display\":\"\",\"element_styles_float\":\"\",\"element_styles_show_advanced_css\":0,\"element_styles_advanced\":\"\",\"cellcid\":\"c3284\",\"field_label\":\"Message\",\"field_key\":\"message\",\"value\":\"\",\"id\":7,\"beforeField\":\"\",\"afterField\":\"\",\"parentType\":\"textarea\",\"element_templates\":[\"textarea\",\"input\"],\"old_classname\":\"\",\"wrap_template\":\"wrap\"},{\"objectType\":\"Field\",\"objectDomain\":\"fields\",\"editActive\":false,\"order\":5,\"idAttribute\":\"id\",\"label\":\"Submit\",\"key\":\"submit\",\"type\":\"submit\",\"created_at\":\"2016-08-24 16:39:20\",\"processing_label\":\"Processing\",\"container_class\":\"\",\"element_class\":\"\",\"wrap_styles_background-color\":\"\",\"wrap_styles_border\":\"\",\"wrap_styles_border-style\":\"\",\"wrap_styles_border-color\":\"\",\"wrap_styles_color\":\"\",\"wrap_styles_height\":\"\",\"wrap_styles_width\":\"\",\"wrap_styles_font-size\":\"\",\"wrap_styles_margin\":\"\",\"wrap_styles_padding\":\"\",\"wrap_styles_display\":\"\",\"wrap_styles_float\":\"\",\"wrap_styles_show_advanced_css\":0,\"wrap_styles_advanced\":\"\",\"label_styles_background-color\":\"\",\"label_styles_border\":\"\",\"label_styles_border-style\":\"\",\"label_styles_border-color\":\"\",\"label_styles_color\":\"\",\"label_styles_height\":\"\",\"label_styles_width\":\"\",\"label_styles_font-size\":\"\",\"label_styles_margin\":\"\",\"label_styles_padding\":\"\",\"label_styles_display\":\"\",\"label_styles_float\":\"\",\"label_styles_show_advanced_css\":0,\"label_styles_advanced\":\"\",\"element_styles_background-color\":\"\",\"element_styles_border\":\"\",\"element_styles_border-style\":\"\",\"element_styles_border-color\":\"\",\"element_styles_color\":\"\",\"element_styles_height\":\"\",\"element_styles_width\":\"\",\"element_styles_font-size\":\"\",\"element_styles_margin\":\"\",\"element_styles_padding\":\"\",\"element_styles_display\":\"\",\"element_styles_float\":\"\",\"element_styles_show_advanced_css\":0,\"element_styles_advanced\":\"\",\"submit_element_hover_styles_background-color\":\"\",\"submit_element_hover_styles_border\":\"\",\"submit_element_hover_styles_border-style\":\"\",\"submit_element_hover_styles_border-color\":\"\",\"submit_element_hover_styles_color\":\"\",\"submit_element_hover_styles_height\":\"\",\"submit_element_hover_styles_width\":\"\",\"submit_element_hover_styles_font-size\":\"\",\"submit_element_hover_styles_margin\":\"\",\"submit_element_hover_styles_padding\":\"\",\"submit_element_hover_styles_display\":\"\",\"submit_element_hover_styles_float\":\"\",\"submit_element_hover_styles_show_advanced_css\":0,\"submit_element_hover_styles_advanced\":\"\",\"cellcid\":\"c3287\",\"field_label\":\"Submit\",\"field_key\":\"submit\",\"admin_label\":\"\",\"id\":8,\"beforeField\":\"\",\"afterField\":\"\",\"value\":\"\",\"label_pos\":\"above\",\"parentType\":\"textbox\",\"element_templates\":[\"submit\",\"button\",\"input\"],\"old_classname\":\"\",\"wrap_template\":\"wrap-no-label\"}];nfForms.push(form);<\/script>\n        ","protected":false},"excerpt":{"rendered":"<p>[vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221;&#8230;<\/p>\n","protected":false},"author":2,"featured_media":16966,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[204,311],"tags":[285,360,352,362],"class_list":{"0":"post-16763","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-ai","8":"category-article-en","9":"tag-ai","10":"tag-artifical-intelligence","11":"tag-software-en","12":"tag-synthetic-data-en"},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Harnessing synthetic data: a cornerstone of AI strategy - Kickmaker<\/title>\n<meta name=\"description\" content=\"Quelles sont les diff\u00e9rentes approches pour appliquer les travaux d&#039;intelligence artificielle \u00e0 l&#039;industrie ? La synthetic data...\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Harnessing synthetic data: a cornerstone of AI strategy - Kickmaker\" \/>\n<meta property=\"og:description\" content=\"Quelles sont les diff\u00e9rentes approches pour appliquer les travaux d&#039;intelligence artificielle \u00e0 l&#039;industrie ? La synthetic data...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/\" \/>\n<meta property=\"og:site_name\" content=\"Kickmaker\" \/>\n<meta property=\"article:published_time\" content=\"2024-05-07T13:22:09+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-05-07T13:22:43+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.kickmaker.fr\/blog\/wp-content\/uploads\/2024\/03\/synthetic-data-and-ai-kickmaker.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1350\" \/>\n\t<meta property=\"og:image:height\" content=\"900\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Alys\u00e9e Flaut\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Alys\u00e9e Flaut\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"10 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/\"},\"author\":{\"name\":\"Alys\u00e9e Flaut\",\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/#\\\/schema\\\/person\\\/04615529c56f5e2bbf8802761d9a884e\"},\"headline\":\"Harnessing synthetic data: a cornerstone of AI strategy\",\"datePublished\":\"2024-05-07T13:22:09+00:00\",\"dateModified\":\"2024-05-07T13:22:43+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/\"},\"wordCount\":3217,\"commentCount\":0,\"image\":{\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/wp-content\\\/uploads\\\/2024\\\/03\\\/synthetic-data-and-ai-kickmaker.webp\",\"keywords\":[\"AI\",\"artifical intelligence\",\"software\",\"synthetic data\"],\"articleSection\":[\"AI\",\"Article\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/\",\"url\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/\",\"name\":\"Harnessing synthetic data: a cornerstone of AI strategy - Kickmaker\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/wp-content\\\/uploads\\\/2024\\\/03\\\/synthetic-data-and-ai-kickmaker.webp\",\"datePublished\":\"2024-05-07T13:22:09+00:00\",\"dateModified\":\"2024-05-07T13:22:43+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/#\\\/schema\\\/person\\\/04615529c56f5e2bbf8802761d9a884e\"},\"description\":\"Quelles sont les diff\u00e9rentes approches pour appliquer les travaux d'intelligence artificielle \u00e0 l'industrie ? La synthetic data...\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/wp-content\\\/uploads\\\/2024\\\/03\\\/synthetic-data-and-ai-kickmaker.webp\",\"contentUrl\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/wp-content\\\/uploads\\\/2024\\\/03\\\/synthetic-data-and-ai-kickmaker.webp\",\"width\":1350,\"height\":900,\"caption\":\"synthetic data and AI\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Accueil\",\"item\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Harnessing synthetic data: a cornerstone of AI strategy\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/\",\"name\":\"Kickmaker\",\"description\":\"Hightech product industrialization brain food\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/#\\\/schema\\\/person\\\/04615529c56f5e2bbf8802761d9a884e\",\"name\":\"Alys\u00e9e Flaut\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/aaa717f85056a19d443e6750dfad64414b9a59742ead9e7e378f13cb07bb453d?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/aaa717f85056a19d443e6750dfad64414b9a59742ead9e7e378f13cb07bb453d?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/aaa717f85056a19d443e6750dfad64414b9a59742ead9e7e378f13cb07bb453d?s=96&d=mm&r=g\",\"caption\":\"Alys\u00e9e Flaut\"},\"url\":\"https:\\\/\\\/www.kickmaker.fr\\\/blog\\\/author\\\/alysee\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Harnessing synthetic data: a cornerstone of AI strategy - Kickmaker","description":"Quelles sont les diff\u00e9rentes approches pour appliquer les travaux d'intelligence artificielle \u00e0 l'industrie ? La synthetic data...","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/","og_locale":"en_US","og_type":"article","og_title":"Harnessing synthetic data: a cornerstone of AI strategy - Kickmaker","og_description":"Quelles sont les diff\u00e9rentes approches pour appliquer les travaux d'intelligence artificielle \u00e0 l'industrie ? La synthetic data...","og_url":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/","og_site_name":"Kickmaker","article_published_time":"2024-05-07T13:22:09+00:00","article_modified_time":"2024-05-07T13:22:43+00:00","og_image":[{"width":1350,"height":900,"url":"https:\/\/www.kickmaker.fr\/blog\/wp-content\/uploads\/2024\/03\/synthetic-data-and-ai-kickmaker.webp","type":"image\/webp"}],"author":"Alys\u00e9e Flaut","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Alys\u00e9e Flaut","Est. reading time":"10 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/#article","isPartOf":{"@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/"},"author":{"name":"Alys\u00e9e Flaut","@id":"https:\/\/www.kickmaker.fr\/blog\/#\/schema\/person\/04615529c56f5e2bbf8802761d9a884e"},"headline":"Harnessing synthetic data: a cornerstone of AI strategy","datePublished":"2024-05-07T13:22:09+00:00","dateModified":"2024-05-07T13:22:43+00:00","mainEntityOfPage":{"@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/"},"wordCount":3217,"commentCount":0,"image":{"@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/#primaryimage"},"thumbnailUrl":"https:\/\/www.kickmaker.fr\/blog\/wp-content\/uploads\/2024\/03\/synthetic-data-and-ai-kickmaker.webp","keywords":["AI","artifical intelligence","software","synthetic data"],"articleSection":["AI","Article"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/","url":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/","name":"Harnessing synthetic data: a cornerstone of AI strategy - Kickmaker","isPartOf":{"@id":"https:\/\/www.kickmaker.fr\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/#primaryimage"},"image":{"@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/#primaryimage"},"thumbnailUrl":"https:\/\/www.kickmaker.fr\/blog\/wp-content\/uploads\/2024\/03\/synthetic-data-and-ai-kickmaker.webp","datePublished":"2024-05-07T13:22:09+00:00","dateModified":"2024-05-07T13:22:43+00:00","author":{"@id":"https:\/\/www.kickmaker.fr\/blog\/#\/schema\/person\/04615529c56f5e2bbf8802761d9a884e"},"description":"Quelles sont les diff\u00e9rentes approches pour appliquer les travaux d'intelligence artificielle \u00e0 l'industrie ? La synthetic data...","breadcrumb":{"@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/#primaryimage","url":"https:\/\/www.kickmaker.fr\/blog\/wp-content\/uploads\/2024\/03\/synthetic-data-and-ai-kickmaker.webp","contentUrl":"https:\/\/www.kickmaker.fr\/blog\/wp-content\/uploads\/2024\/03\/synthetic-data-and-ai-kickmaker.webp","width":1350,"height":900,"caption":"synthetic data and AI"},{"@type":"BreadcrumbList","@id":"https:\/\/www.kickmaker.fr\/blog\/harnessing-synthetic-data-a-cornerstone-of-ai-strategy\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Accueil","item":"https:\/\/www.kickmaker.fr\/blog\/"},{"@type":"ListItem","position":2,"name":"Harnessing synthetic data: a cornerstone of AI strategy"}]},{"@type":"WebSite","@id":"https:\/\/www.kickmaker.fr\/blog\/#website","url":"https:\/\/www.kickmaker.fr\/blog\/","name":"Kickmaker","description":"Hightech product industrialization brain food","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.kickmaker.fr\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.kickmaker.fr\/blog\/#\/schema\/person\/04615529c56f5e2bbf8802761d9a884e","name":"Alys\u00e9e Flaut","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/aaa717f85056a19d443e6750dfad64414b9a59742ead9e7e378f13cb07bb453d?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/aaa717f85056a19d443e6750dfad64414b9a59742ead9e7e378f13cb07bb453d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/aaa717f85056a19d443e6750dfad64414b9a59742ead9e7e378f13cb07bb453d?s=96&d=mm&r=g","caption":"Alys\u00e9e Flaut"},"url":"https:\/\/www.kickmaker.fr\/blog\/author\/alysee\/"}]}},"_links":{"self":[{"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/posts\/16763","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/comments?post=16763"}],"version-history":[{"count":11,"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/posts\/16763\/revisions"}],"predecessor-version":[{"id":16969,"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/posts\/16763\/revisions\/16969"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/media\/16966"}],"wp:attachment":[{"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/media?parent=16763"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/categories?post=16763"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.kickmaker.fr\/blog\/wp-json\/wp\/v2\/tags?post=16763"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}