Discussions
Categories
- 196.9K All Categories
- 2.2K Data
- 239 Big Data Appliance
- 1.9K Data Science
- 450.3K Databases
- 221.7K General Database Discussions
- 3.8K Java and JavaScript in the Database
- 31 Multilingual Engine
- 550 MySQL Community Space
- 478 NoSQL Database
- 7.9K Oracle Database Express Edition (XE)
- 3K ORDS, SODA & JSON in the Database
- 546 SQLcl
- 4K SQL Developer Data Modeler
- 187K SQL & PL/SQL
- 21.3K SQL Developer
- 295.9K Development
- 17 Developer Projects
- 138 Programming Languages
- 292.6K Development Tools
- 107 DevOps
- 3.1K QA/Testing
- 646K Java
- 28 Java Learning Subscription
- 37K Database Connectivity
- 155 Java Community Process
- 105 Java 25
- 22.1K Java APIs
- 138.1K Java Development Tools
- 165.3K Java EE (Java Enterprise Edition)
- 18 Java Essentials
- 160 Java 8 Questions
- 86K Java Programming
- 80 Java Puzzle Ball
- 65.1K New To Java
- 1.7K Training / Learning / Certification
- 13.8K Java HotSpot Virtual Machine
- 94.3K Java SE
- 13.8K Java Security
- 204 Java User Groups
- 24 JavaScript - Nashorn
- Programs
- 442 LiveLabs
- 38 Workshops
- 10.2K Software
- 6.7K Berkeley DB Family
- 3.5K JHeadstart
- 5.7K Other Languages
- 2.3K Chinese
- 171 Deutsche Oracle Community
- 1.1K Español
- 1.9K Japanese
- 232 Portuguese
ORACLE Text - using wordlist to build a "Did you mean?.." search feature

Hi,
using 19c Enterprise Edition and APEX 21.2.
We have some internal knowledge-database application which is searchable using Oracle Text, indexed column contains HTML text. People now voted for a "Did you mean? xyz" feature when they search for a word and the search has no result.
My idea was to use the wordlist generated by Oracle Text - to be precise the DR$xxx$I table and search that table using utl_match.edit_distance_similarity or jaro_winkler_similarity and return the best matching row (1 row only) to the user.
Problem: the text index contains the words for more than one "client" and the user is only allowed to see specific articles/clients. So i have to put the token_text from DR$xxx$I in some relationship to restrict access to words that are in other client-articles which the user has no access to.
- Does Oracle Text support adding custom values to the wordlist when creating the wordlist ?
- Does Oracle Text already have any similar feature?
thanks & regards
Answers
-
Oracle Text features: fuzzy, stemming, thesaurus.
The Fuzzy match is similar to the utl_match.xxx_similarity.
-
Be careful with utl_match. It compares the bytes, two-byte characters may be a half-similar.
-
I wouldn't limit yourself just just 1 row as you may have multiple matches with the same "score".