Jump to content

EN:AI: Difference between revisions

From IP7 Wiki
No edit summary
No edit summary
 
(7 intermediate revisions by the same user not shown)
Line 1: Line 1:
Overview of AI features within the Compass: <br/>
Overview of AI features within the Compass: <br/>


* [[EN:Text_search#Semantic_Search|AI supported semantic search]]
* [[EN:Text_search#AI_-_Full-Text_Search|AI supported semantic search]]
* [[EN:Folders#Folders_and_AI|AI and folders - automatic creation and filling of folders]]
* [[EN:Folders#Folders_and_AI|AI and folders - automatic creation and filling of folders]]
* [[EN:AI_Innovator|AI Innovator - question a result list (LLM)]]
* [[EN:AI_Innovator|AI Innovator - question a result list (LLM)]]
* [[EN:DetailView_AI|AI in the detail view - simplify reading patents]]
* [[EN:DetailView_AI|AI in the detail view - simplify reading patents]]
 
* [[EN:Highlighting#AI_-_Synonyms|AI and Highlighting - finding synonyms]]
* [[EN:Text_search#AI_-_Add_Synonyms|AI and text search - finding synonyms]]
* [[EN:Applicant#AI_-_Associated_Companies|AI and applicant search - finding associated companies]]
*[[EN:Result_List#AI|AI sorting of results]]
----
----
----
* [[EN:AI_Averbis|Averbis AI - folder classification]]
 
<pre style="color: red" >
The following functions are currently still in the pilot phase.
Individual functions, interfaces and scope may change in the future.
</pre>
 
The following AI functions are '''not''' available in Compass by default. <br/>
It is an additional module that can be activated by IP7 on request. <br/>
There are additional costs for the AI module. <br/>
The AI from [https://averbis.com Averbis] has been integrated into Compass. <br/>
 
The AI can (in future) be used for various areas in Compass: <br/>
* Folder assignment or automatic classification of patents
* Automatic sorting out of "not relevant" hits (available soon!)
* Sorting of the results list (coming soon!)
 
The main feature of an AI is that it is trained on the basis of predefined data. <br/>
The AI should improve or "learn" through constant human correction and renewed training. <br/>
AI is therefore always dependent on human performance. <br/>
The quality of the AI is therefore always dependent on this given data or on a human correcting the AI. <br/>
 
 
== Folder assignment ==
 
Folder structures are created in Compass to classify patents according to individual criteria. <br/>
For example, a technology tree can be created to assign patents to specific technologies. <br/>
With the "Automatic classification" function, the AI can take over this assignment task. <br
 
=== Classifier ===
 
[[File:AI_classifier.jpg|800px]]
 
Before the AI can assign patents to folders, the AI must learn or be trained based on existing assignments. <br/>
An existing folder structure with already correctly assigned patents is required.<br/>
 
A "classifier" can then be created.<br/>
Some settings are defined here, whereby many of the settings affect the training: <br/>
 
*Activate
Here you can specify whether the classifier can be used or whether regular automatic training takes place.<br/>
*Name
 
*Training frequency
This defines how often the classifier is trained automatically. <br/>
As the contents of the folders change over time, it makes sense to have the classifier trained regularly. <br/>
 
*Max. # Documents per folder
The maximum number of documents per folder that the AI can use for training. <br/>
Too few documents per folder will reduce the quality of the training. <br/>
Too many documents per folder and the training will take a lot of time. <br/>
<br/>
Ideally, the AI should always receive the same number of documents for each folder. <br/>
For example, if there are a few folders with approx. 500 assignments and many folders with approx. 50 assignments. <br/>
Then the value should be set to 50 so that training can be balanced and better results can be achieved. <br/>
The limit value here is a maximum of 1,000 documents. <br/>
If the number in a folder is higher, 1,000 documents are randomly selected from this folder. <br/>
 
*Min. # documents required for training
What is the minimum number of documents that must be in the folder for them to be used in training? <br/>
If there are only 3 documents in a folder, for example, the AI will not be able to draw any good conclusions from them. <br/>
A minimum value of 10 documents must be specified. <br/>
If the number in a folder is smaller, these folders/documents are not used. <br/>
 
*Min. confidence level
"Confidence Level" (in per cent) describes how confident the AI is with the assignment. <br/>
Everything below the minimum value ends up in the "unclassifiable" folder. <br/>
This is defined later in the automatic classification. <br/>
 
*Folder
Here you can define which folders are used for training. <br/>
The "automatic classification" will later make the assignments in these folders. <br/>
The AI can then also assign a patent or patent family to multiple folders.<br/>
A minimum of 3 folders/sub-folders have to be selected. <br/>
A maximum of 500 folders can be selected. <br/>
 
*Data
Here you define which data the AI receives for the training: <br/>
Title, Summary, Claims, Description, IPC, CPC <br/>
If Description is selected, the AI receives significantly more data for training. <br/>
This will also have a corresponding effect on the training time. <br/>
 
*Status
The current status of the classifier is displayed here: <br/>
 
"Training required" <br/>
After the classifier has been created, training is required. <br/>
 
"Training" <br/>
The classifier is currently being trained. <br/>
During training, automatic classifications that use this classifier cannot be executed. <br/>
 
"Ready" <br/>
The classifier is ready for use in an automatic classification. <br/>
This also means that the last training session was carried out successfully. <br/>
 
"Error" <br/>
An error has occurred. <br/>
Successful training is a prerequisite for use in automatic classification. <br/>
 
*Last training <br/>
This shows when the last (successful) training session was carried out. <br/>
 
The current status can be retrieved using the Refresh button: <br/>
[[File:AI_classifier_Status_refresh.jpg|500px]]
 
As soon as the classifier has finished training (status "Ready"), it can be used in an automatic classification. <br/>
 
==== Training statistics ====
Evaluation of the last training run. <br/>
[[File:AI_classifier_Status_analysis.jpg|500px]]
 
Further information on the terms Precision, F1Score and Recall: <br/>
https://en.wikipedia.org/wiki/Precision_and_recall
 
==== Limitations ====
In total (across all folders used in the classifier), a maximum of 10,000 patents are used for training. <br/>
If there are more than 10,000 patents in the folders, these are reduced proportionally. <br/>
If a folder is then below the minimum number of patents, the selected patents are increased to the minimum number. <br/>
This means that the limit is not exactly 10,000. <br/>
 
The classifier requires at least 3 folders/subfolders. <br/>
 
=== Automatic classification ===
 
[[File:AI_automatic_classification.jpg|800px]]
 
The following settings are available for automatic classification: <br/>
 
*Activate
Here you can specify whether classification is to be carried out automatically. <br/>
This option can be used to quickly stop an active automatic classification in the event of problems. <br/>
 
*Name
 
*Monitored folders
The "Incoming folders" are defined here. <br/>
All patents in these folders are then classified by the AI. <br/>
And, logically, all patents that are assigned to these folders in the future. <br/>
<br/>
Not to be confused with the folders of the classifier: <br/>
The folders into which the AI classifies/assigns the patents are defined in the classifier. <br/>
 
*Non-classifiable" folder
All patents that cannot be classified by the AI are assigned here. <br/>
 
*Classifier
The previously created classifier is selected here. <br/>
 
*Status
"idle" - automatic classification is currently not running. <br/>
"running" - automatic classification is currently running. <br/>
 
*Last checked on
The system regularly checks whether "new" patents are available for automatic classification. <br/>
If "new" patents are available, automatic classification is started. <br/>
The date indicates when a check was last carried out. <br/>
A check can also be triggered manually using the "Execute" button. <br/>
 
 
A classifier can theoretically be used for several automatic classifications. <br/>
An example of this: <br/>
 
There are several vehicle types that are monitored and should then be assigned to a technology tree: <br/>
"1, vehicle types" -> "bicycle" and "motorbike" <br/>
 
There is a folder structure or technology tree in which all vehicle types (vehicles with 2 wheels) are to be assigned: <br/>
"2, two wheel technologies" <br/>
 
However, the hits that the AI cannot classify should be saved separately. <br/>
For this reason, an automatic classification is created for "bicycle" and "motorbike". <br/>
In this case, however, the classifier only needs to be created once. <br/>
 
==== Limitations ====
A maximum of 5,000 hits/patents can be classified in one run. <br/>
 
If there are more than 5,000, automatic classification is not carried out and is deactivated. <br/>
If the run is triggered manually, a corresponding warning is displayed. <br/>
If the run is then started anyway, only up to 5,000 patents are classified. <br/>
 
Patents that have been classified in a run do not have to be removed from the input folder. <br/>
They are recognised as already classified in the next run and are not classified again. <br/>
To have one or more patents reclassified, they must be removed from the folder and then reassigned. <br/>
This reassignment means that they are no longer recognised as already classified. <br/>

Latest revision as of 10:40, 19 February 2026

Cookies help us deliver our services. By using our services, you agree to our use of cookies.