The semantic search module represents the core functionality of the DaMSym tool.
It is designed to allow users to query multilingual textual corpora — Arabic, Church Slavonic, Greek, Latin, Sanskrit, and Latin & Greek — through an intelligent system based on conceptual similarity between query terms and the content present in the texts.
Unlike traditional search engines, which operate on literal matches, DaMSym processes the inserted terms by evaluating semantic proximity between concepts.
The result is a list of texts correlated not only by keyword, but also by meaning and linguistic context, thereby supporting comparative analysis and philological study.
Semantic search is available to all users, including unauthenticated users (Guest), while some advanced functions — such as adding resources, reviewing, or approving — require authenticated access.
The search engine interface is organized in a simple yet functional way.
All input tools are placed on the left side of the screen, arranged vertically and including:
The right side of the page is dedicated to displaying search results, which are updated in real time according to the selected settings.
The visual separation between input area and output area ensures clarity and immediacy, allowing users to modify parameters without reloading the page.
All search parameters are interdependent: any modification to filters, query terms, or concept weights dynamically influences the semantic context and the list of displayed results.
The system supports six main linguistic domains, each characterized by its own internal rules and metadata structure.
Despite internal differences, Arabic, Greek, Latin, and Latin & Greek share the same filter structure, while Church Slavonic and Sanskrit present different filter configurations.
| Language / Domain | Main Characteristics |
|---|---|
| Arabic, Greek, Latin, Latin & Greek | Common filter structure: Authors, Works, and Period. In the case of the Latin & Greek combination, the underlying search model simultaneously queries the Greek and Latin datasets. Consequently, the available metadata and filter structure are identical for both languages, and the returned results include texts from both corpora. |
| Church Slavonic | Includes a dedicated font selection dropdown to ensure correct character rendering. Provides filters for Language and Historical/Regional Variant. Does not include management of authors or works. |
| Sanskrit | Includes only the Works filter. No additional parameters are provided. Automatic transliteration is available to improve text readability. |
The Advanced Search functionality in DaMSym allows users to refine the precision of semantic search by combining the mandatory main query with one or two optional additional search phrases.
The main search always represents the core of the search process.
Additional phrases serve exclusively to refine, contextualize, or modulate the results, without ever replacing or outweighing the main search.
Each search block contributes to the final result through a numerical weight ranging from 0 to 1, representing its relative importance compared to the other active searches.
When the user activates the Advanced Search section:
These values represent an initial suggestion and can be modified by the user.
The user may choose to add a second additional phrase by clicking the “+” button.
In this case, a new section becomes available, and the user can assign three distinct weights to the three search terms or phrases, starting from the default values proposed by the system.
If both additional phrases are active, clicking the “–” button allows the user to remove the most recently added one.
The sum of the assigned weights must always be equal to 1. The system automatically recalculates values to maintain the semantic balance of the overall search configuration.
The semantic engine balances the search according to these proportions, returning results consistent with the indicated conceptual combination.
This weighting logic makes the search more flexible and suitable for comparative or multidisciplinary studies.
The weight assigned to each search phrase can be adjusted using a slider or through an input field, which provides arrow controls for changing the value.
In addition to semantic terms, users can narrow the scope of the search through a series of contextual filters, which vary depending on the selected language.
There is synchronization between the Authors and Works filters: when selecting an author, the Works filter is automatically updated to display only the works associated with that author.
Consequently, modifying the selected author dynamically updates the list of available works in the Works filter.
A Reset button is also available, allowing users to completely clear the search, including selected filters and text entered in the main search bar.
Search results are displayed as a list on the right side of the screen.
Each entry includes:
Results can be sorted according to three criteria:
In the results page, no direct text highlighting is applied for any of the supported languages.
By selecting the “More Details” button, the user accesses the detailed metadata view, where a contextualized portion of text is displayed — specifically, the segment semantically closest to the executed search query.
By clicking “More Details” next to a result, the user accesses a detailed view containing all information associated with the selected text:
From this same view, authenticated users (Researcher, Reviewer, and WP Lead) may propose corrections directly on the highlighted text, modify metadata using the Edit button (which appears next to each metadata field when hovering over it), and add new metadata using the Add Metadata button located on the right, immediately below the metadata list.
Within the More Details view, users with the roles of Researcher, Reviewer, and WP Lead can interact with the displayed text and metadata according to controlled procedures.
In particular:
All modifications are not applied directly but are saved as correction proposals.
Proposals are visible in the Corrections section of the Dashboard, where:
Search and filters in DaMSym do not operate independently but in a relationship of dependency: any modification to parameters influences the semantic processing of the query.
For example, selecting a specific author automatically restricts the semantic context to the subset of texts associated with that author.
This dynamic architecture ensures a fluid, coherent, and scientifically accurate search experience, suitable for comparative studies and high-level linguistic analysis.