Yeast Related - Subtopic Landscape

A synthetic biology perspective

The subset of SynBio – Yeast related patents were further investigated to identify subtopics and assess trending areas. The topic model leverages a hybrid approach based on the optimised extractive summary for each publication. Using a combination of topic discovery via fine-tuned transformer based deep learning and ground truth cross referencing via keyword and classification codes. The process enables a patent to belong to more than one topic for accurate multi-classification trends, accounting for multiple invention embodiments. Please see the topic model page for further details regarding the topic model methodology to avoid duplication here.

Subtopic counts

The identified subtopics within the yeast related topic are shown in figure 1. The topics are ranked based on the total number of published applications since 2015. A patent application can be counted more than once as it can belong to multiple topics.

In figure 1, the classification of documents across the top 5 subtopics are as follows; genetic engineering (52.8%), enzyme (compositions & genetic engineering) (45.9%), gene editing (29.8%), food (26.7%) and recombinant proteins & nucleic acids (23.6%). It is interesting to note that yeast are widely used in food applications with more than 25% of documents classified here. Yeasts have also been widely used in biofuels and in particular to produce ethanol and engineered for waste processing and conversion by feeding on waste biomass and feedstocks. This also applies to oleaginous yeast which are ideal candidates for biofuel production by producing microbial lipids.

The yeast related subtopic publication year trends are shown in figure 2. Publication trends discussed below are based on EP A1/A2 applications, identified patents can belong to more than one subtopic due to multiple invention embodiments.

In figure 2, the fastest growing subtopics identified of note and based on compound annual growth rates during 2016-24 are Saccharomyces (25.7%), alternative proteins (17.4%), food (16.6%), antibody uses/therapeutics (13.2%), genetic engineering (12.3%), gene editing (11.7%), enzyme (compositions & genetic eng. etc.) (11.5%) and fusion proteins (11.4%) above the 10% growth rate threshold.

Subtopic top 20 assignees distributions (published since 2015)

The patent portfolios of the top 20 assignees within the SynBio – Yeast related dataset are analysed in figure 3. The portfolios are restricted to publications during 2015-24, mapped to the subtopics identified, the counts represent total EPO publications.

The heatmap in figure 3, reveals the distribution of the top 20 assignees since 2015, publications can be assigned to more than one subtopic, reflecting multiple invention embodiments. Regarding biofuels, major assignees utilising yeast include DANISCO (50 publications) and NOVOZYMES (43 publications) where it appears ethanol is an important focus. DSM IP ASSETS is focused towards enzyme related applications (76 publications) with some focus towards food. LESAFFRE is the second largest assignee in the food related subtopic. There are also assignees such as IMMATICS and LONZA making use of yeast for antibody production, etc.

The analysis does not account for earlier publications prior to 2015, which may have contributed to companies developing market share, etc. and potential licensing and acquisitions (subsidiaries). The analysis is an informative guide as some specific subtopics have strict content boundaries to enable differentiation, whilst others are broader to capture more generic areas.