CRISPR - Subtopic Landscape

A synthetic biology perspective

The subset of SynBio – CRISPR related patents were further investigated to identify subtopics and assess trending areas. The topic model leverages a hybrid approach based on the optimised extractive summary for each publication. Using a combination of topic discovery via fine-tuned transformer based deep learning and ground truth cross referencing via keyword and classification codes. The process enables a patent to belong to more than one topic for accurate multi-classification trends, accounting for multiple invention embodiments. Please see the topic model page for further details regarding the topic model methodology to avoid duplication here.

Subtopic counts

The identified subtopics within the CRISPR topic are shown in figure 1. The topics are ranked based on the total number of published applications since 2015. A patent application can be counted more than once as it can belong to multiple topics.

The classification of documents across the top 5 subtopics are as follows; gene editing (100%), enzyme (compositions & genetic eng. etc.) (86.4%), interfering nucleic acids (74.5%), gene therapy (47.3%) and engineered cells e.g. stem cells (46.6%). CRISPR is viewed as a potential tool for treating diseases reflected in the 4th placed ranking of gene therapy (47.3%). By correcting deleterious mutations or disrupting specific genes, CRISPR could deliver precision and efficiency for treating disease. The UK is the first country in the world to approve CRISPR based gene editing as a potential cure for two inherited blood disorders. The remaining subtopics reveal the uses of CRISPR based gene editing, etc. for various genetic engineering applications including the engineering of microorganisms. Due to the emerging nature of the technology there is no change in the rankings between the two time periods.

The CRISPR subtopic publication year trends are shown in figure 2. Publication trends discussed below are based on EP A1/A2 applications, identified patents can belong to more than one subtopic due to multiple invention embodiments.

In figure 2, the fastest growing subtopics identified of note and based on compound annual growth rates during 2016-24 are chimeric antigen receptor (64.6%), t-cell related (60.2%), cancer (50.8%), viruses and bacteriophages (45.9%), Interfering nucleic acids (41.8%), mammalian cells (41.4%) and genetic eng. – viruses (41.3%), which are all above the 40% threshold.

Subtopic top 20 assignees distributions (published since 2015)

The patent portfolios of the top 20 assignees within the SynBio – CRISPR dataset are analysed in figure 3. The portfolios are restricted to publications during 2015-24, mapped to the subtopics identified, the counts represent total EPO publications.

The heatmap in figure 3, reveals the distribution of the top 20 assignees since 2015, publications can be assigned to more than one subtopic, reflecting multiple invention embodiments. The US based research institutions are top assignees active across the major CRISPR subtopic areas. Gene therapy is also a focus for many leading assignees such as the largest company sector assignees EDITAS MEDICINE, CRISPR THERAPEUTICS and REGENERON. Many of the leading assignees are also using CRISPR to fight cancer. CRISPR THERAPEUTICS has expertise in the chimeric antigen receptors subtopic. Beyond MIT and HARVARD, PIONEER HI-BRED INT are a leading company establishing a CRISPR-CAS breeding platform to develop seeds, etc. INSCRIPTA has specific expertise in the genetic engineering of bacteria, especially E.coli, utilising CRISPR based gene editing for the editing of genomes at scale for applications such as biomanufacturing.

The analysis does not account for earlier publications prior to 2015, which may have contributed to companies developing market share, etc. and potential licensing and acquisitions (subsidiaries). The analysis is an informative guide as some specific subtopics have strict content boundaries to enable differentiation, whilst others are broader to capture more generic areas.