1 Introduction

Artificial intelligence (AI) is one of the fundamental technologies brought about by the Fourth Industrial Revolution. This technology is present in different fields and has been credited with several ground-breaking products and services that have the potential to change many aspects of society at large (Bengi & Heath, 2020). However, even if there is extended academic research on AI, as well as great interest from industry, governments and individuals, there is not a universal definition of it (JRC, 2018).

AI is a general-purpose technology with widespread applications that are generating important concerns around aspects such as ethics, data protection or Intellectual Property (IP). Regarding the latter, IP policy represents a key aspect for innovation and creativity since it stimulates economic and cultural systems. AI interplays with IP in several ways and has resulted in interesting discussions in recent years. However, most of the attention has been addressed to the questions of authorship and inventorship, leaving aside other relevant IP questions.

This paper aims to focus on the disclosure requirement in the patent realm. Sufficiency of disclosure is one of the fundamental pillars that justify the patent system. It ensures certain transparency of human inventiveness in exchange for which the inventor is granted a time-limited patent right. With AI-related inventions entering the patent system, disclosure may be affected due to its ‘black-box’ character (Früh, 2021), i.e. the algorithmic decision-making of the inventions may be opaque, resulting in reproducibility issues (Waltl & Vogl, 2018), and affecting the validity of sufficiency of disclosure. This leads us to pose the following research question: How could sufficiency of disclosure be improved in the field of AI?

To answer the research question, a combination of research methods has been used, composed of a systematic literature review and empirical analyses consisting of a patent landscape and two case studies of AI-related patents from the EPO. Due to the extended analysis of each case study, it was decided to limit the selection of AI-related patents to a core-AI patent and an AI-application. The selection of the case studies was performed using the database Espacenet.

Regarding the structure, this paper begins by explaining the field of AI together with a patent landscape, focusing on the evolution of AI-related patent applications through time and the areas in which AI is more present and gaining relevance. Secondly, the patentability of AI-related inventions is briefly explained highlighting its particularities and the relevance of the person skilled in the art. Thirdly, the sufficiency of disclosure requirement is addressed from a legal, philosophical and economic perspective showing its relevance for innovation and society but also underlying the challenges that it entails, especially in the field of AI. Fourthly, two case studies of AI-related patents are presented, assessing how the inventions are disclosed and highlighting the heterogeneity of disclosure between both patents. Lastly, conclusions are developed together with three proposals that could help avoid vagueness and heterogeneity while disclosing AI-related inventions.

2 Artificial Intelligence

AI is considered as an emerging technology, due to the important developments that are being done within it (EPO, 2020). Even if AI is attracting attention from different disciplines —e.g., engineering, medicine, economics or law— due to its wide application possibilities, this technology is not new.

AI finds its roots back in the 1940s when it consisted in programs that executed specific instructions by the programmer (Bathaee, 2018), but it was not until the 1950s that AI was introduced as an academic discipline and defined as ‘machines behaving in ways that would be called intelligent if a human were so behaving’ (McCarthy et al., 1955). Since then, AI has evolved largely thanks to other technology developments, such as computer power, and it has moved, for instance, to machine learning (ML) algorithms that ‘can learn, adapt to changes in a problem’s environment, establish patterns in situations where rules are not known, and deal with fuzzy or incomplete information’ (Negnevitsky, 2005).

As abovementioned, even if there is an important interest from different sectors of society in the potential of AI, there is not a universal definition of it (Waltl & Vogl, 2018). Nevertheless, there have been attempts to find a proper legal definition (AI HLEG, 2018; Proposal for the Artificial intelligence Act, 2021). According to WIPO, AI can be understood as a computer science discipline aiming to develop machines and systems that can execute tasks considered to require human intelligence, with minimal or no human intervention (WIPO, 2020). The discipline of AI is in fact a catch-all term that covers a fast-evolving family of technologies including ML,Footnote 1 machine reasoningFootnote 2 and roboticsFootnote 3 (Drexl et al., 2019; AI HLEG, 2018).

AI can contribute to a wide array of industries and social activities by providing competitive advantages —such as optimising operations or improving predictions— that can result in economic and social benefits (Proposal for the Artificial Intelligence Act, 2021).Footnote 4 In this sense, inventors dealing with AI-related inventions are interested in getting patents since, even if AI is based on computational models and mathematical algorithms which are of an abstract nature, AI-related inventions may not be excluded from patentability if they possess technical purposes and they are claimed as such (Art. 52(3) EPC; EPO Guidelines for Examination, section G-II, 3.3.1; Decisions T154/04, T1173/97, T935/97). To show the interest in patenting AI-related inventions, this paper exposes an AI-related patent landscape.

2.1 AI-Related Patent Landscape

To conduct the patent landscaping of AI-related patents, the online available patent database EspacenetFootnote 5 is used. AI-related patents can be identified by the presence of AI-relevant keywords —such as machine learning (ML), AI or fuzzy logic, among othersFootnote 6 (Baruffaldi, et al., 2020). Those relevant keywords are related to the AI technique that is used in the patent,Footnote 7 and its functional application.Footnote 8 Therefore, a search queryFootnote 9 using 18 AI-relevant keywordsFootnote 10 extracted from the WIPO’s AI patent landscape is conducted. For the purpose of this patent landscape, only patents filed until 31 December 2019Footnote 11 are included, due to data reliability. As shown in Fig. 1, AI is already present across all CPC classes and therefore several technological fields, including ‘green technologies’ (Y02 and Y04S).

Fig. 1
figure 1

Distribution of AI-related patents across CPC sections and some relevant technology fields (generated with Tableau)

Regarding the time development of AI-related patent filings, Fig. 2 shows that both, worldwide and European patents, have been on the patent scene for some time, but it is only since 2015 that they have seen an unprecedented increase. This may be due to several aspects that have been highlighted above, such as computer power, but also because of technological advances and research interest in the field of AI (JRC, 2018).

Fig. 2
figure 2

Time development of AI-related patents worldwide (orange) and in Europe (blue) (generated with Tableau)

This section has shown the growing relevance of AI-related patents in Europe and worldwide, as well as the variety of fields that AI has reached, including diverse and heterogeneous areas such as chemistry, transporting, textiles, mechanical engineering and/or green technologies. The increasing importance, as well as the prevalence across technology fields, of AI-related patents underlines the need to research issues related to this type of patent. Moreover, the scant literature regarding disclosure justifies and motivates the scope of this paper, and therefore its focus on disclosure in AI-related patents.

3 Artificial Intelligence and Patent Law

AI has triggered interesting debates and literature on several aspects of patent law. Discussions among experts have particularly focused on inventorship, patentability and non-obviousness issues (Abbot, 2016; Fraser, 2016; Ramalho, 2018; Bengi & Heath, 2020; Balos, 2021; Slowinski, 2021). Based on those matters, it has been questioned whether the current patent system is optimal and up to date to meet the challenges posed by AI.

However, before thinking about amending laws or creating new ones to consider AI developments —which would not consist of either a technology-neutral solution or a realistic one (Ramalho, 2018; Hartmann et al., 2020; Balos, 2021)— it is relevant to assess the current patent system, which already includes specific guidelines to assess AI-related inventions in IP offices (e.g. EPO Guidelines for Examination G-II, 3.3.1) and has been able to deal with AI-related issues (Decision J0008/20 DABUS). In this paper, we focus on the EPC definitions and EPO procedure, even if other patent systems may be mentioned.

3.1 Patentability of AI-Related Inventions at the EPO

A patent is a territorial exclusive right granted for an invention, in all fields of technology, which consists of a product or process that is new, involves an inventive step and is capable of industrial application (Art. 27 TRIPS). To benefit from a patent right, the inventor must disclose technical information to the public through the patent application (Art. 29 TRIPS).

According to the EPO, inventions involving AI are considered ‘computer-implemented inventions’ (CII) which entail computers, computer networks or other programmable devices, whereby a feature is performed by means of a program (EPO Guidelines for Examination, F-IV, 3.9). Under the EPC, a CII is considered a patentable subject matter if it is not excluded from patentability (Art. 52 EPC) and fulfils the patentability requirements of novelty (Art. 54 EPC; Decisions T12/81 and T198/84), inventive step (Art. 56 EPC; Decision T154/04) and susceptibility of industrial application (Art. 57 EPC; Decision T144/83). This approach also applies to AI-related inventions (EPO Guidelines for Examination, G-II, 3.3.1).

The presence of the patentability requirements is assessed by patent examiners who are considered as persons ‘skilled in the art’ (Art. 56 EPC; EPO Guidelines for Examination G-VII, 3; Decision T641/00), also known as ‘PHOSITA’Footnote 12 (Meara, 2002; Fabris, 2020; Slowinski, 2021). In that sense, it is important to bear in mind that with CII, and therefore AI-related inventions, it is common to find mixed-type inventions, i.e. inventions with claims comprising technical and non-technical featuresFootnote 13 (EPO Guidelines for Examination G-II, 2; Decisions T26/86; T769/92; T641/00; T531/03; T154/04; T1784/06). In particular, with AI-related inventions, according to the EPO, the use of terms such as ‘support vector machine’ or NN may refer to abstract models and algorithms that do not imply the use of technical means but may be part of a technical solution. It has been suggested that due to this aspect, and the particularities that AI-related inventions entail, determining who is ‘skilled in the art’ could be brought into question. This would affect both the assessment of the inventive step and sufficiency of disclosure, since the required level of skill for both aspects is alike (Ramalho, 2018; Schellekens, 2022).

4 The Rationales of Sufficiency of Disclosure

4.1 Legal and Philosophical Rationales

With the advent of AI-related patents and their extensive and potential use in diverse fields, the social contract theory that justifies the patent system underlines its informational purpose. This theory emphasises the role that the patent system plays in achieving information uncovering since through patent disclosure, it is possible to define the scope of protection of the invention, provide society with a survey of the state-of-the-art and facilitate further developments, as well as know-how licensing (Ramalho, 2018; Beier & Straus, 1977; Kitch, 1977). Likewise, the disclosure requirement possesses important legal justifications, for enforcement and litigation purposes, since depending on how the patent is disclosed, it may be invalidated (Fromer, 2009; Decisions G2/93, G2/93, G2/98).

Disclosure is a universal precondition for the granting of a patent (Art. 29.1 TRIPS). It refers to the incentive to reveal the invention to society through the patent application, instead of keeping the invention as a trade secret (Roffe, 1974; Osborn, 2019). In return, inventors obtain exclusive rights for their inventions.

According to patent law, sufficiency of disclosure is present when a patent application allows the reproduction of the intended technical solution. In that sense, an invention may be sufficiently disclosed even if certain elements are not explicitly described by the applicant, but the examiner must be able to fill in the missing information to reach reproducibility (Art. 83 EPC; Decision T61/14; Früh, 2021). Art. 83 EPC’s requirement aims to bring transparency to the description of the inventive process in the claims, thus enabling reproducibility. However, the rationale of this requirement has been largely discussed before AI, presenting arguments against it —such as its inefficiency or the fact that it disincentivises patent applications (Früh, 2021; Devlin, 2010; Burk & Lemley, 2002).

Sufficiency of disclosure must be assessed taking the application as a whole — including the description and claims— (Decisions T14/83, T169/83). However, the claims are a key element through which the patentee specifies the scope of the patent (Chakroun, 2020). Even if the claims should be drafted in terms of the technical features of the invention, their wording must be clear and concise (Art. 84 EPC; Decision T988/02). Accordingly, only a reasonable degree of clarity is required, since the patent language is examined by the person skilled in the art, considering the prior art and context of the invention at the time of its conception (Chakroun, 2020; Decisions T2001/12, T1180/14). As a result, insufficient disclosure will occur if the skilled person cannot rework the invention following the teaching of the patent (Decision T432/10).

The ‘person skilled in the art’ and the ‘reasonable degree of clarity’ requirement consist of two aspects that entail considerable consequences since the level of knowledge of the examiner in some cases is difficult to determine (Hartmann et al., 2020) and it may be brought into question with AI-related patents (Ramalho, 2018; Schellekens, 2022). According to the examiner’s perspective, some elements may be considered obvious —and would therefore not need to be included in the patent claims— but they may constitute relevant information for users to understand the invention, avoid infringement, but also exploit it when it enters the public domain (Chakroun, 2020).

Despite the necessity of clarity, contemporary patent applications rarely are concise or disclose all the information required for the proper exploitation of the patent (Wernick, 2021). This tendency may be due to different aspects, such as technology complexity (Ebrahim, 2020), the use of specific technical jargon —so-called ‘patentese’ (Seymore, 2010)— but also to strategies for effective litigation (Trop, 1988; Bessen & Meuer, 2005).

In this sense, given that patent claim drafting requirements do not demand exhaustive clarity, patent examiners may grant patents entailing issues regarding claim construction or vagueness (Alison et al., 2010). This issue may be underlined by the black-box character of AI-related inventions, since it may become challenging to provide clear and sufficient disclosure for inventions and therefore be carried out by the skilled person (Hartmann et al., 2020).

As an attempt to amend the issue of vagueness, it has been suggested by scholars to introduce the requirement of a ‘claim chart’Footnote 14 together with the patent application, to help patent examiners, or users that may be interested in —or in the need of— interpreting patent claims (Churnet, 2013). This type of solution would not consist of legal reform and therefore would not entail a long and tedious process but could be adopted in the form of guidelines or policy (Ramalho, 2018). This proposal could be highly beneficial for AI-related inventions and their ‘explainability’ and could also serve as a step towards a standard process for patent claims construction.

4.2 Economics’ Rationales

The economics’ rationales of disclosure find their roots in the fact that through the dissemination of patent information, it may be possible to foster innovation. Access to patent information has proven to increase innovative activities and help research and development (R&D) managers execute innovative projects (Furman et al., 2021; Cohen et al., 2002). Moreover, the clarity of how the information is disclosed plays a major role, since patents with a higher quality of disclosure generate more follow-on innovation (Dyer et al., 2021) and favour competition, as they enable competitors to learn about new inventions and encourage them to create substitutes for the patented products or processes (Wernick, 2021). Therefore, higher quality of disclosure stimulates the patent system (Chakroun, 2020; Fromer, 2009).

Although evidence suggests that there are positive effects in patent disclosure, studies focusing on the effects of disclosure in Europe are to our knowledge scant, and even more, if we look for studies dealing with disclosure of AI-related patents. Nevertheless, empirical studies in other jurisdictions, such as the US, have shown that through stricter disclosure requirements or different aspects in the patent system that affect disclosure like the existence of a grace period (EPO, 2022), it may be possible to increase innovative activities and improve the scientific innovation flow (Baruffaldi & Simeth, 2020; Franzoni & Scellato, 2010; EPO, 2022).

Despite the positive effects of disclosure, its compliance faces two main challenges. Firstly, complying with disclosure is a demanding task for patent examiners, and secondly, patent applicants have incentives to be vague in their patent applications. Applicants know that their information may be useful for competitors, and this may incentivise patent applicants to draft claims clearly enough for patent offices to grant the patent application, but unclear enough from the competitors’ lens (Arinas, 2012). Patent disclosures are a strategic element in firms’ innovative activities (Baker & Mezze, 2005), and firms are aware that their private information is a source of market opportunities and competitive advantages (Barbaroux, 2014). In that sense, drafting vague claims can be seen as a business strategy to establish a balance between the requisites to obtain patent protection and the patentee’s commercial advantage (Chakroun, 2020). The employment of deliberate vagueness in claims as a strategic use concerns the development function of the patent system, failing to inform the public of the real scope of the invention, to foster innovation and competitiveness, thus affecting the informative and teaching value and the quality of the patent system (Drexl et al., 2021; Chakroun, 2020). Apart from strategic incentives, economic policy uncertainty and changes in disclosure policies may increase the willingness of patent applicants to be vague (Kim, 2019; Amore, 2020).

Concerning patent examiners, they may confront challenges when determining appropriate prior art for patent applications dealing with mixed-type technologies (such as AI-related inventions), in contrast to ‘discrete technologies’, like chemistry, which are more straightforward. Moreover, in the case of mixed-inventions, it is sometimes difficult to distinguish which specific element in the complex system is in the patent scope (Tan & Roberts, 2010). Patent examiners themselves may also be a source of heterogeneity when complying with the disclosure requirement. Evidence suggests that there are ‘disclosure-lenient examiners’, meaning that some are less strict than others, have lower patent rejection rates and are more likely to accept less clear patents having fewer figures, worse score in disclosure metrics and NLP readability (Dyer et al., 2021). Moreover, specific circumstances like higher workloads and personal characteristics of examiners (e.g. experience) may cause heterogeneous patent examination processes, leading to granting patents that do not fulfil equally patentability requirements (Kim et al., 2017; Cockburn et al., 2002).

5 Disclosing Artificial Intelligence: Case Studies

Despite its importance, the concept of sufficiency of disclosure in AI has not been treated with the same attention as other aspects under the patent law prism (Früh, 2021).

However, being aware of the particularities that AI entails, the EPO has developed specific guidelines of examination for this field. According to those, AI is based on computational models and algorithms meaning that they possess an abstract mathematical nature even if they can be trained based on training data. The use of specific terms such as ‘support vector machine’ or NN may refer to abstract models and algorithms that do not imply the use of technical means but also can be part of mixed-inventions. Apart from core-AI inventions, the EPO highlights that AI finds applications in several fields that can represent technical contributions. In some cases, providing the steps of generating a training set in the patent application may contribute to the technical character of the invention (EPO Guidelines for Examination, section G-II, 3.3.1).

When an applicant files a patent, it is then subject to an examination done by a patent examiner following the guidelines for examination. As part of this procedure, a search report is issued, containing citationsFootnote 15 that establish relevant prior art, which plays a role while evaluating novelty and inventive step of a patent application (Loveniers, 2018). These documents, according to the EPO Boards of Appeal (Decision T0466/09), can be considered as previous technical literature which is generally common knowledge for the person skilled in the art, and can be used to assess disclosure. Documents cited in the search report can be of any type (e.g., research papers) if they possess relevant information for the patent grant. For the analysis of disclosure, the focus lies on the following type of citations:

  • X: relevant documents for inventive step and novelty

  • Y: relevant documents for novelty

  • A: documents giving the general state-of-the-art

  • D: documents cited in the application

To show how differently sufficiency of disclosure can be executed regarding AI-related patent applications, two case studies, both granted patents by the EPO, are presented:

  1. 1.

    EP3432107A1: ‘Cleaning robot and controlling method thereof’, classified under CPC G05D 1/02 and G06T 7/00 application of an AI-technique to a home appliance

  2. 2.

    WO2018094294A1: ‘Spatial Attention Model for image captioning’, classified under CPC G06N 3/04, core-AI, granted at the EPO with the publication number EP3542314B1

Methodology

The two selected patents were found using Espacenet. For the first patent, the selected search criteria are the presence of AI-related keywords and keywords related to machines or home appliances. This enables finding patents that consist of applications of AI techniques. For the second patent, the selected search criterion was looking for patents classified under CPC G06N,Footnote 16 which according to the Cartography of the Fourth Industrial Revolution (EPO, 2017) represent patents that are classified as core-AI. Both patents are granted, and their selection is motivated by the fact that albeit both patents successfully passed the examination process, they exhibit differences in the clarity of disclosure.

The first patent (hereinafter Patent 1) is an application of an AI technique, whereas the second (hereinafter Patent 2) is a core-AI patent. The reasoning for this selection is that depending on the type of AI-related patent (whether core-AI or the application of an AI-technique), nuances regarding disclosure can be detected, which will be explained in detail in this section.

The structure of the case studies will be as follows. Both case studies start with a brief description of the invention. Secondly, an analysis of the prior art documents included in the search report is conducted, as these documents are essential to assess the quality of disclosure of a patent. Finally, considering the results of the case studies, a summary of how AI-related patents should be disclosed is proposed.

5.1 Case Study 1: EP3432107A1, ‘Cleaning Robot and Controlling Method Thereof’

The European patent application EP3432107A1,Footnote 17 filed by LG Electronics, for Cleaning robot and controlling method thereof, classified under G05D 1/02 and G06T 7/00, describes a vacuum cleaner that recognises obstacles and performs autonomous travelling.

The patent includes fifteen claims. The first three claims specify the use of training data and algorithms: ‘a controller (1800) configured to detect a difference between first (701) and second image (702) consecutively capture by the camera (130, 1400) generate a third image (704) based on the detected difference, and analyze the generated third image (704) using the preset algorithm’.

Moreover, claim 3 refers to the training data, mentioning ‘the cleaner of claim 2, wherein the memory (1700) is configured to store training data for the analysis of the image, and the controller (1800) is configured to determine a degree of similarity between a plurality of image information including the training data and third image’.

In the following claims, there is neither mention of the type of algorithm that is being used, nor if it is an AI-based algorithm.Footnote 18 Nevertheless, given the information in the description, it can be assumed that it is a ML algorithm. ‘The controller 1800 performs a role of processing information based on an artificial intelligence technology […] the controller 1800 may use a machine learning technology’.

The reference to AI continues with statements such as ‘The controller 1800 may use training data stored in an external server or a memory, and may include a learning engine for detecting a characteristic for recognizing a predetermined object’. In patent’s Fig. 6, represented by Fig. 3 in this paper, and the summary of the invention, there is mention of the fact that the analysis is done using a deep learning algorithm: ‘Furthermore, an object of the present disclosure is to provide a cleaner for combining monitoring traveling with a deep learning algorithm to perform autonomous traveling’.

Fig. 3
figure 3

Patent’s Fig. 6 describing the algorithm that the vacuum cleaner uses, mentioning the aim for which the deep learning algorithm is implemented

Also, from the patent’s Fig. 9, represented by Fig. 4 in this paper, it can be concluded that the algorithm is used for object detection and image classification:

Fig. 4
figure 4

Patent’s Fig. 9 describing the aim of the deep learning algorithm in object detection

As there is no mention, nor specific description of the algorithm, the next step would be to verify if the documents cited on the search reportFootnote 19 include information in this regard. The European search report includes two Y citations and two A citations, consisting of patents. Focusing on A citations (i.e., citations relevant to prior art), two US patents are cited:

  • Patent US2009161911A1: ‘Moving object apparatus and method’ discloses a method to detect moving objects. To this aim, the patent presents a method that while capturing several images at different points in time can detect moving objects and changes in the images using temporal differencing and distance maps to obtain information regarding the moving object.

  • Patent US2011243383A1, ‘Image processing device method and program’ discloses a method that detects changes in images, by targeting differences between pixels in the analysed images.

Despite that both patents analyse differences between several images and explain the algorithms and techniques that are used, there is no mention of AI-based algorithms.

Regarding the Y citations referred to in the European search report, there is a German patent and an international patent:

  • The German patent DE10200501582A1 consists of a method comparing two images in the context of semiconductor components.

  • The international patent WO2016005011A1 consists of a vacuum cleaner that also compares two images to improve travelling by using a method based on a ratio of displacement.

Similarly, to the A citations, none of these documents mentions if AI-techniques are being used.

Even if deep learning comprises several techniques and networks, like convolutional NN, and recurrent NN (Pouyanfar et al., 2018), there is no mention of which type of these techniques and networks the disclosed technology makes use of. This is relevant as in object detection and image classification, using deep learning, several techniques can be applicableFootnote 20 (Kaushal et al., 2018; Dargan et al., 2020).

5.2 Case Study 2: WO2018/094294A1, ‘Spatial Attention Model for Image Captioning’

The international patent application WO2018/094294A1,Footnote 21 classified under G06N 3/04 and filed by Salesforce, describes a system able to generate captions for a given image. The applicant specifies at the beginning of the document that the field of technology disclosed relates to AI, specifically to image captioning. The relevance of the invention, and its research field, lies in aiding visually impaired users and navigating through large amounts of data that are typically unstructured. This highlights the importance of generating high-quality captions for images. Hence, this type of AI-application combines both elements of computer vision and NLP.

The patent includes 22 claims, most of them devoted to explaining an image caption-generating system that leverages several AI-techniques. Moreover, the patent includes 25 figures, some of them devoted to comparing prior AI-algorithms and the disclosed algorithm as well as tables providing metrics to assess the quality of the technology disclosed. To this aim, state-of-the-art databases and metrics are used.

Claim 1 summarises the elements composing the system:

  1. 1.

    An encoder for processing the image through a convolutional neural network (abbreviated CNN) and producing image features for regions of the image

  2. 2.

    A global image feature generator for generating a global image feature […]

  3. 3.

    An input preparer […]

  4. 4.

    The decoder for processing the input through a long short-term memory network (abbreviated LSTM) […]

  5. 5.

    An attender for accumulating at each decoder timestep, an image context as a convex combination of image features scaled by attention probability masses […]

  6. 6.

    A feed-forward neural network for processing the image context and the current decoder hidden state to emit a next caption word […]

  7. 7.

    A controller for iterating the input preparer, the decoder, the attender and the feed-forward neural network to generate the natural language caption for the image until the next caption word emitted in an end-of-caption token

In the detailed description of the patent, there are subsections and figures describing each of the elements mentioned in claim 1, and further claims. According to the disclosed technology and documents cited in the search report, image captioning systems contain an encoder-decoder and an attention-model, and the applicant mentions that ‘an opportunity arises to improve the performance of attention-based models’. The inventors explain that while generating words for the caption, not all words have visual signals, e.g., ‘The words “a” and “of” do not have corresponding canonical visual signals’, to this aim, linguistic correlation could be used when generating so-called stop words.Footnote 22 The improvement opportunity lies in determining ‘the importance that should be given to the target image during caption generation’, as there are visual and non-visual words. This means that the main improvement made in this image captioning system is the attention-model.

These models determine which parts of the image are needed to focus on when generating a word and where and when to attend (Lu et al., 2017). Figure 2A, B from the patent, represented by Fig. 5, show differences in the attention decoder from the prior art and disclosed technology which is one of the elements included in the disclosed attention-model.

Fig. 5
figure 5

Patent’s Fig. 2A, B comparing prior art with the disclosed technology

The patent also includes in the drawings tables that show the performance of the disclosed technology using the Flicker30k and COCO datasetsFootnote 23 and the improvements when compared to the previous state-of-the-art. The metrics included are rank probabilities and NLP metrics such as BLEU (Bilingual Evaluation understudy),Footnote 24METEOR (Metric for evaluation of evaluation with explicit ordering),Footnote 25CIDEr (Consensus-based image description evaluation),Footnote 26ROUGE-L (Recall-oriented understudy for Gisting evaluation)Footnote 27 and SPICEFootnote 28 (Semantic Propositional Image Caption Evaluation).Footnote 29 Patent’s Fig. 24 (represented by Fig. 6 in this paper) shows a state-of-the-art leader board that compares the disclosed technology with other image captioning generating systems:

Fig. 6
figure 6

State-of-the-art leader board presented in the patent, comparing the disclosed technology with other well-known models in the field

Image captioning systems based on deep learning can have different architectures and use different approaches (Hossarin et al., 2019; Lu et al., 2017; Kiros et al., 2014). The international search report includes three A citations (consisting of two research papers and a patent from Google) and three X citations, all research papers. The research papers cited in the search report are all attention-based models with encoder-decoder frameworks. This means that the cited documents are in a specific area of research of image captioning models, making it clear in which way the disclosed technology adds value and improves this research field. The X citations consist of:

  • Xu et al. (2015),Footnote 30 which describes a caption-generating system with an encoder that uses a CNN and a decoder that uses a LSTM. The authors test two attention models ‘stochastic hard attention’ and ‘deterministic short attention’. To test the quality of these models, they also use the Flickr30k and COCO databases, which are summarized in Table 1.

  • Chen et al. (2017)Footnote 31 describes a captioning-generating system and aims at testing how different attention-models improve the quality of the generated captions. In this sense, they distinguish between semantic attention-models, which also use semantic information, and image attention-models which depend solely on visual features. The attention-models that the authors test rely on exploding several attention types. The results are also summarised in Table 1. The last paper is an A citation,Footnote 32 which also refers to an attention-based model, but as other metrics are provided, they are not included in Table 1.

Table 1 Comparison between disclosed technology and prior art documents using state-of-the-art metrics and databases

Continuing with A citations:

  • Kiros et al. (2014) is also a paper referring to an attention-based model, but as other metrics are provided, they are not included in Table 1.

  • Merity et al. (2016) refers to the linguistic model included in the image captioning system. This linguistic model can predict the following word given the linguistic context. This is one key component of the image captioning system disclosed.

  • International patent WO 2016/077797A1, filed by Google, describes an image captioning system but there is no mention of if it is attention-based.

To summarise, the improvements of the image captioning model are made in the attention-model, which is clearly explained and described. As for elements that are not part of the improvements proposed by the patent, like the CNN, the inventors mention the use of a pre-trained ResNET CNN. This enables the person skilled in the art to know which CNN should be used from the variety of CNN architectures available (e.g. AlexNet, VGG, GoogleNET). As a result, the person skilled in the art can find all the necessary information, even if it is not available in the patent.

5.3 Discussion: Lessons from the Case Studies

Through both case studies, it is possible to observe differences based on the way both patents are disclosed and were granted by the EPO. In that sense, these case studies serve to research to determine heterogeneity in disclosure by just showing two examples of AI-related patents.

While reading the patents, without considering documents in the search report, it can be easily noticed how differently both are disclosed. Patent 2 includes a long description of the technology, as well as quantitative metrics, which in this case are quality metrics to evaluate the image captioning system with respect to prior art. According to Dyer et al. (2021), specific pieces of information like quantities and percentages can be considered a disclosure metric that serves to assess how well a technology is disclosed. On the other hand, patent 1 has an extremely vague and short description, and it does not provide specific pieces of information.

Of the two patents presented, one is an application of an AI technique (Patent 1) and the other is a core-AI patent (Patent 2). NNs are a good example to explain the meaning of core-AI, since their contribution can consist of (1) the network itself (connections, weights, and training data); (2) the application in a specific context or (3) general concepts that can be applied to other networks (e.g. multiperceptron and back propagation algorithms; Watkin & Rau, 1996). Core-AI represents cases 1 and 3, in which improvements to the AI-technique are made.

Even if Patent 1 is not classified under G06N (core-AI), there is a mention of the use of a deep learning algorithm, but there are no further explanations regarding the architecture and type of such algorithm. In this case, as the EPO Boards of Appeal have already determined (Decision, T0161/18), it is questionable whether the person skilled in the art could be able to reproduce the disclosed technology without having any mention of how the deep learning algorithm is constructed, trained nor the type of data used. Mentioning this, for example, by citing relevant prior art, could be helpful, even if the patent does not make any improvement to the deep learning technique as such.

On the other hand, Patent 2 clearly discloses the area of AI to which it is contributing to, and the documents cited in the search report are all related to the relevant prior art of attention-models for image captioning. In this patent, the use of prior art is twofold: it serves as a benchmark to show how the patent improves the prior art, and it also serves to disclose elements relevant to the invention that are prior art. Elements that are novel and represent an improvement of an AI technique must be properly described in the patent application.

Figure 7 provides a summary of these conclusions:

Fig. 7
figure 7

Summary of disclosure in AI-related patents (generated with Visio)

These two case studies have shown how the quality of disclosure varies in both patents. This could lie in the fact that for certain technologies, it is easier to determine the prior art (Tan & Roberts, 2010), which could also justify the difference in the sufficiency of disclosure between both patents. Regarding Patent 2, there is a large amount of academic literature explaining image captioning models, their types and the appropriate techniques to evaluate their quality. However, this is not the case for Patent 1.

Nevertheless, the home appliance mentioned in Patent 1 uses a deep learning algorithm for object detection and image classification, and these areas also have wide academic literature and evaluation metrics available. If possible, it may be useful to explain why this specific algorithm is used, e.g. by using object detection metrics (Karasulu, 2010), since the evaluation of object detection models on benchmark datasets plays an important role in the validation of an algorithm (Kaushal et al., 2018).

Following Ouellette (2012), patent literature should be more in line with scientific literature in the way results and inventions are disclosed. For instance, both Patent 2 and the image captioning papers serve as an example of how researchers in the field must explain prior art they built upon, as well as the procedures followed to demonstrate the improvement of the AI-based techniques.

These two case studies aim at showing that not all granted patents fulfil the disclosure requirement in the same way. Therefore, it highlights the importance of studying more in-depth the quality of compliance with this requirement. This paper has only presented two case studies but are of substantial importance since they are illustrative examples of the underlined problem and therefore worth bringing to the surface in detail. Nevertheless, even if extending this type of study to evaluate a representative number of patents is complex, it would be highly advisable for further research. Lastly, the EPO could evaluate a representative sample of granted patents to assess the quality of disclosure, to better guarantee the fulfilment of this requirement.

6 Conclusions and Considerations

AI-related patent filings have been growing significantly in recent years, which highlights the importance of this technological field. With the increase of such patents, certain issues regarding sufficiency of disclosure are of great importance from the perspective of IP offices, but also the users’ one.

Legal, philosophical rationales and empirical economic studies justify and show the positive effects that disclosure has on innovation and society, but also the fact that patent applicants have incentives to be vague on their patent claims.

Vagueness in patent claims affects the benefits of the patent system, especially the function of sufficiency of disclosure. This issue may be highlighted by AI-related patents and their presence in several diverse fields due to their black-box character. Intentional vagueness in the wording of a patent application may be due to the strategic relevance that a claim may have. In that sense, patent claims have been described as sentences that can be worth millions and ensure a company’s financial success or even its ruin (Jakobsen Osenga, 2006).

Regarding the research question posed by this paper: How could sufficiency of disclosure be improved in the field of AI? AI-related patents are already present in several technological fields and reaching a disclosure standard applicable to all technologies could be challenging, since there is no common consensus on a standard process for patent claim construction (Chakroun, 2020). Therefore, establishing disclosure standards to simplify the compliance of disclosure is not straightforward. This paper has intended to show that issue by presenting two case studies. Both showed how differently and heterogeneous the disclosure requirement can be carried out in AI-related patents at the EPO.

From the IP offices’ perspective, in this case, the EPO, it could be recommended to require applicants to provide a detailed description, or a higher standard, in terms of explanation and disclosure of the logical processes involved in AI-related patents. This could help patent examiners to avoid black box cases (EPO, 2018), which represent the inability to fully understand ‘an AI’s decision-making process and the inability to predict the AI’s decisions or outputs’ (Batahee, 2018). While requiring a higher standard may benefit the quality of granted patents, it may also be more costly and time-consuming and thus discourage AI-related patent applications. Likewise, to obtain an accurate perspective of how disclosure is taking place in the AI-related patent realm, it could be advisable for the EPO to evaluate a representative sample of granted patents to assess the quality of disclosure.

As for the users’ and society’s perspective, it is important to bear in mind that the disclosure requirement is not just a question of getting the patent accepted, but also of enabling reproducibility and being able to enforce it in case of litigation. The technical standard of an examiner is obviously higher than the one of the ordinary public, but when it comes to enforcement, the judge must be able to understand how the patent works and proving infringement might be challenging, hence the need to avoid black box situations (EPO, 2018).

The conducted analysis does not aim at giving a final solution to the issue of disclosure of AI-related patents but intends to highlight the relevance of the topic and the need of further research on it. In that sense, to provide a proper solution, the involvement of persons skilled in the art of each AI field should be required, since they are the ones able to determine all the necessary information to be considered. However, the results of this research allow the authors to suggest some solutions that may be useful to improve the sufficiency of disclosure requirement in the field of AI-related patents.

The first solution concerns AI applications. Despite technological fields being large, functional applications are much more limited, being the most frequent ones: character recognition, computer vision, object detection and speech processing (WIPO, 2019). The specific cases of image captioning models, as well as object detection, have developed metrics to assess the quality of an algorithm. A first step and suggestion in achieving a disclosure standard would be to analyse the availability of metrics and benchmark datasets for all functional applications. In cases where core-AI is improved, a benchmark analysis could be presented. If no improvements on core-AI are made, then refer to the model being implemented and justify the selection of it.

The second solution regards patent claims drafting. As an attempt to amend the issue of vagueness in patent claims, it has been suggested by scholars to introduce the requirement of a ‘claim chart’ together with the patent application, to help patent examiners, but also users that may be interested —or in the need— of interpreting patent claims (Churnet, 2013; Chakroun, 2020). This solution could also serve as a step towards a standard process for patent claims construction.

Lastly, the third possible solution involves the public. Back in 2010, the USPTO developed a peer review patent programFootnote 33 in which the public was encouraged to submit relevant prior art for pending patents, using the assistance of experts in each functional application.

Those proposals could help to ‘bridge the gap between the person skilled in the art’ (Früh, 2021) and therefore ensure that patent offices, applicants and society have equal access to patent information and promote innovation.