For the best experience, this project uses the Webflow Input Enhancer extension. We highly recommend installing it. Click here to download (use preview mode to access link)
A powerful new foundation for custom queries—built on Lucene and designed for R&D precision.
Over the past few years, Cypris has helped innovation teams make faster, more informed decisions by centralizing critical insights across datasets like patents, academic papers, and company activity. But until now, our search experience relied on a legacy query system with limited capabilities, offering little support for advanced search features or dataset-level customization.
Today, we’re excited to introduce an upgraded Advanced Search on Cypris, a complete overhaul of our query engine and search experience, powered by the open-standard Lucene query syntax. This update introduces a more robust and flexible search foundation, unlocking new ways to query data, build complex filters, and extract precisely what you need across patents, research, and more.
Why we rebuilt our search system from the ground up
Cypris’ original query syntax, a proprietary format used internally for years, limited users’ ability to craft advanced queries or tailor searches to specific datasets. It lacked modern capabilities like proximity searches, field-level customization, or true Boolean logic. This made it difficult to build a reliable and intuitive experience for both casual users and advanced researchers.
By moving to Lucene, we’re adopting a powerful, industry-standard query language that makes it easier for developers to build advanced features—and gives users access to a far more capable and flexible search toolset.
What’s new in Advanced Search
1. Custom Queries by Dataset You can now layer queries to search across datasets or tailor filters to each one. For example, you can run a broad query on drone delivery, and then add separate layers to focus on patents by a specific assignee and papers from a specific country or funding agency.
Navigating the All Datasets tab introduces a new level of complexity—and power—by allowing users to apply dataset-specific logic within a single, unified query workflow. While querying multiple datasets simultaneously might seem straightforward, the underlying differences in schema, metadata, and available fields between our proprietary datasets make this a deeply technical challenge. Patents, for example, include claims, application numbers, and multiple date fields (filed, granted, updated), while academic papers use DOIs, have different structural conventions, and emphasize different metadata. In the past, we sidestepped this complexity by translating general queries like ((drone_allText)) into dataset-specific logic under the hood. Now, instead of obscuring that logic, we allow users to opt in to it. The builder provides progressive layers of customization: start with intuitive keyword searches across all fields, then move into the advanced builder for field-specific targeting, fuzzy logic, and term boosting, and finally, tailor query logic by dataset—such as specifying different countries of interest for papers vs. patents. This approach preserves flexibility while giving users full control, and with tools like our real-time Live Analysis and “Your Query” panel, we make it easy to understand how every decision affects the results.
2. More Fields to Query We’re exposing deeper fields across datasets—giving you explicit control over the dimensions of your search. For the first time, users can now search academic papers by DOI, a critical identifier previously unsupported on the platform. You can also query by:
- Author or inventor names
- Organizations or assignees
- Countries, journals, funding agencies, and more
3. Full Boolean Support Advanced Search now leverages powerful Boolean logic—AND, OR, NOT, and grouping—enabling more precise control over search logic and improving performance and accuracy.
4. Lucene Syntax Features Use built-in Lucene features to create expressive, complex searches:
- Proximity searches to find terms near each other
- Fuzzy searches for flexible matching
- Exact phrase matching
- Boosting to prioritize results (e.g., prioritize results mentioning AI 3x more than others)
- Prefix/Postfix queries to match phrases that start or end a certain way
- Range queries for fields like date, funding amounts, or numerical values
A more powerful user experience
Our new search interface is built to help you tap into these capabilities without needing to know the syntax from the start. You’ll find:
- A Query Builder to guide you through complex searches
- A Help Video to onboard users to Lucene-style searches
- Inline examples and tips for writing queries using grouping, boosting, and more
Built for precision, speed, and customization
With Lucene as our foundation, search results are now not only more flexible but also faster and more accurate. Semantic search continues to offer natural-language ease of use, while Boolean search gives power users the performance and structure they need to uncover insights with greater specificity.
Whether you’re an innovation analyst drilling into AI patents or a business development lead scanning academic papers from Chilean researchers—Advanced Search is built to help you get to the signal, faster.
Available now to all users
Advanced Search is live and available across the Cypris platform today. If you’re already using Cypris, you’ll find the new search interface in your dashboard, complete with updated syntax documentation and walkthroughs.
We’re excited to see what you’ll build, discover, and analyze with this new capability. This is just the beginning—we’ll continue expanding the fields, syntax features, and customization options as we push the boundaries of what intelligent search can do for R&D.
Subscribe to receive the latest blog posts to your inbox every week.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
A powerful new foundation for custom queries—built on Lucene and designed for R&D precision.
Over the past few years, Cypris has helped innovation teams make faster, more informed decisions by centralizing critical insights across datasets like patents, academic papers, and company activity. But until now, our search experience relied on a legacy query system with limited capabilities, offering little support for advanced search features or dataset-level customization.
Today, we’re excited to introduce an upgraded Advanced Search on Cypris, a complete overhaul of our query engine and search experience, powered by the open-standard Lucene query syntax. This update introduces a more robust and flexible search foundation, unlocking new ways to query data, build complex filters, and extract precisely what you need across patents, research, and more.
Why we rebuilt our search system from the ground up
Cypris’ original query syntax, a proprietary format used internally for years, limited users’ ability to craft advanced queries or tailor searches to specific datasets. It lacked modern capabilities like proximity searches, field-level customization, or true Boolean logic. This made it difficult to build a reliable and intuitive experience for both casual users and advanced researchers.
By moving to Lucene, we’re adopting a powerful, industry-standard query language that makes it easier for developers to build advanced features—and gives users access to a far more capable and flexible search toolset.
What’s new in Advanced Search
1. Custom Queries by Dataset You can now layer queries to search across datasets or tailor filters to each one. For example, you can run a broad query on drone delivery, and then add separate layers to focus on patents by a specific assignee and papers from a specific country or funding agency.
Navigating the All Datasets tab introduces a new level of complexity—and power—by allowing users to apply dataset-specific logic within a single, unified query workflow. While querying multiple datasets simultaneously might seem straightforward, the underlying differences in schema, metadata, and available fields between our proprietary datasets make this a deeply technical challenge. Patents, for example, include claims, application numbers, and multiple date fields (filed, granted, updated), while academic papers use DOIs, have different structural conventions, and emphasize different metadata. In the past, we sidestepped this complexity by translating general queries like ((drone_allText)) into dataset-specific logic under the hood. Now, instead of obscuring that logic, we allow users to opt in to it. The builder provides progressive layers of customization: start with intuitive keyword searches across all fields, then move into the advanced builder for field-specific targeting, fuzzy logic, and term boosting, and finally, tailor query logic by dataset—such as specifying different countries of interest for papers vs. patents. This approach preserves flexibility while giving users full control, and with tools like our real-time Live Analysis and “Your Query” panel, we make it easy to understand how every decision affects the results.
2. More Fields to Query We’re exposing deeper fields across datasets—giving you explicit control over the dimensions of your search. For the first time, users can now search academic papers by DOI, a critical identifier previously unsupported on the platform. You can also query by:
- Author or inventor names
- Organizations or assignees
- Countries, journals, funding agencies, and more
3. Full Boolean Support Advanced Search now leverages powerful Boolean logic—AND, OR, NOT, and grouping—enabling more precise control over search logic and improving performance and accuracy.
4. Lucene Syntax Features Use built-in Lucene features to create expressive, complex searches:
- Proximity searches to find terms near each other
- Fuzzy searches for flexible matching
- Exact phrase matching
- Boosting to prioritize results (e.g., prioritize results mentioning AI 3x more than others)
- Prefix/Postfix queries to match phrases that start or end a certain way
- Range queries for fields like date, funding amounts, or numerical values
A more powerful user experience
Our new search interface is built to help you tap into these capabilities without needing to know the syntax from the start. You’ll find:
- A Query Builder to guide you through complex searches
- A Help Video to onboard users to Lucene-style searches
- Inline examples and tips for writing queries using grouping, boosting, and more
Built for precision, speed, and customization
With Lucene as our foundation, search results are now not only more flexible but also faster and more accurate. Semantic search continues to offer natural-language ease of use, while Boolean search gives power users the performance and structure they need to uncover insights with greater specificity.
Whether you’re an innovation analyst drilling into AI patents or a business development lead scanning academic papers from Chilean researchers—Advanced Search is built to help you get to the signal, faster.
Available now to all users
Advanced Search is live and available across the Cypris platform today. If you’re already using Cypris, you’ll find the new search interface in your dashboard, complete with updated syntax documentation and walkthroughs.
We’re excited to see what you’ll build, discover, and analyze with this new capability. This is just the beginning—we’ll continue expanding the fields, syntax features, and customization options as we push the boundaries of what intelligent search can do for R&D.
Keep Reading
•
XX
min read
Is Google Scholar good for research? This question is often raised by researchers and professionals in various fields. In this blog post, we will examine the benefits and drawbacks of Google Scholar to determine its appropriateness for your research requirements.
We will discuss the extensive coverage provided by Google Scholar, its ranking system for relevance in comparison with other databases such as Scopus and Web of Science, and the citation tracking functionality offered by Google Scholar.
To conclude our analysis on “Is Google Scholar good for research?”, we’ll highlight the importance of complementing it with specialized databases like PubMed or IEEE Xplore for specific disciplines or combining it with Scopus or Web of Science for advanced search capabilities.
Yes, Google Scholar is a valuable resource for research as it offers extensive coverage of scholarly literature, including conference papers, books, preprints, and journal articles. Its ranking system helps in identifying relevant resources while the citation tracking functionality aids in analyzing impact factors.
Extensive Coverage of Google Scholar
Google Scholar offers a vast range of scholarly literature, indexing over 160 million documents from various sources such as conference papers, books, preprints, and journal articles. Google Scholar provides a convenient way to access an extensive range of scholarly material, eliminating the need for users to search through multiple websites or databases.
Conference Papers Indexed in Google Scholar
The platform includes an extensive collection of conference papers from numerous disciplines. By accessing these resources through Google Scholar, researchers can stay up-to-date with the latest findings presented at conferences around the world.
Books Available Through the Search Engine
In addition to academic articles and conference proceedings, Google Scholar also indexes books published by reputable publishers. Researchers can use this feature to locate essential reference materials for their projects and gain insights into previous studies conducted within their field.
Preprints and Journal Articles Accessible via the Platform
Preprints: These are preliminary versions of research papers that have not yet been peer-reviewed but are made available online for feedback from other experts in the field. By including preprint repositories like arXiv.org or bioRxiv.org in its search results, Google Scholar helps researchers discover cutting-edge work before it is formally published.
Journal Articles: As one would expect, a significant portion of indexed content on Google Scholar consists of peer-reviewed journal articles across various fields. The platform’s comprehensive coverage ensures that users can access high-quality research material efficiently while conducting searches using keywords related to their area of interest.
For those asking “is google scholar good for research”, Google Scholar is an excellent tool for researchers looking to find relevant and reliable sources quickly. Its extensive coverage of various types of scholarly literature, including conference papers, books, preprints, and journal articles, makes it a valuable resource for anyone conducting research.
Google Scholar employs a sophisticated algorithm to rank search results based on their relevance, taking into account factors such as the author’s citation count and publication history. This ranking system has been found to provide better precision than other multidisciplinary databases like Scopus or Web of Science, particularly when searching for specific topics within respective fields.
A study by Martin-Martin et al. demonstrated that Google Scholar outperforms these alternatives in terms of precision and coverage.
Factors Considered in Ranking Search Results
Citation count: The number of times an article has been cited by others is used as an indicator of its importance and impact within the field.
Publication history: Articles published in well-established journals with high impact factors are more likely to be ranked higher, reflecting their perceived quality and credibility.
Affiliation: The reputation of the authors’ institutions can also influence rankings, with prestigious universities often being associated with higher-quality research output.
Comparison with Scopus and Web of Science
In comparison to Google Scholar, both Scopus and Web of Science offer advanced search capabilities allowing users greater control over filtering options; however, they may not always deliver superior results due to limitations in their indexing scope or potential biases towards certain disciplines or sources.
Google Scholar’s ranking system for relevance provides an effective way to identify the most relevant and impactful research, allowing R&D teams to quickly gain insights into their topics of interest making it the option to choose when asking “is google scholar good for research”. Moving on, citation tracking functionality through Google Scholar can provide further insight into the impact factor of a particular piece of research.
When asking “is google scholar good for research”, one key feature that makes it suitable for research purposes is its citation-tracking functionality. Researchers can easily track citations received by their work or others, helping them stay informed about recent developments in their field while also providing valuable insight into the impact factor of publications they are interested in citing themselves.
Benefits of Tracking Citations Using Google Scholar
Ease of use: With a simple interface, researchers can quickly access information on how many times an article has been cited and view the list of citing articles.
Breadth of coverage: Google Scholar’s extensive database ensures that users have access to a wide range of citation data from various sources such as conference papers, books, preprints, and journal articles.
Analyzing trends: By monitoring citation patterns over time, researchers can identify emerging trends within their field and assess the significance or relevance of specific topics.
Impact Factor Analysis Through Citation Data
The number of citations an article receives is often used as an indicator of its impact within a particular discipline. While this metric has limitations – such as potential biases towards older publications with more time to accumulate citations – it still provides useful insights when comparing different resources during literature reviews or grant applications.
By utilizing Google Scholar’s search results alongside other databases like Scopus or Web of Science, R&D managers, and engineers can make better-informed decisions regarding which publications hold greater weight within their respective fields. Citation tracking functionality is a powerful tool for R&D and innovation teams, allowing them to quickly access the literature they need while understanding its impact.
Despite its benefits, there are limitations associated with using Google Scholar exclusively for conducting research. Some of the key challenges include a lack of quality control, incomplete metadata records, limited advanced search options compared to other databases, inconsistencies in coverage regarding specific disciplines or journals, and a lack of transparency on the methodology behind content indexing and result rankings.
Quality Control Concerns with Unfiltered Resources
Google Scholar’s unfiltered approach may lead to the inclusion of low-quality resources such as predatory journals or self-published articles that have not undergone rigorous peer-review processes. This makes it crucial for researchers to verify the credibility of sources before citing them in their work.
Incomplete Metadata Affecting Resource Selection Process
The incomplete metadata records retrieved through Google Scholar often lack essential bibliographic details, including abstracts, which can make it difficult for users to assess the relevance of a resource without having to visit each individual source website.
Limited advanced search options available in Google Scholar, when compared with specialized databases like Scopus or Web of Science, restrict researchers from carrying out comprehensive literature reviews by narrowing down results based on specific criteria such as publication date range or document type.
Inconsistency in Indexing Affecting Representation of Available Literature
Google Scholar’s coverage of specific disciplines, journals, or individual articles can be inconsistent, which may lead to gaps in the available literature and hinder researchers from obtaining a complete understanding of their research topic.
Lack of Transparency on Google Scholar’s Methodology
The obscurity of Google Scholar’s indexing and rating process renders it difficult for people to comprehend how search outcomes are produced, potentially producing imbalances in the depiction of scholarly material within its database.
Despite its limitations and challenges, Google Scholar remains a valuable tool for research teams. However, it is important to supplement the platform with specialized databases in order to maximize search capabilities.
Key Takeaway:
Using Google Scholar exclusively for research has limitations such as a lack of quality control, incomplete metadata records, limited advanced search options compared to other databases, inconsistencies in coverage regarding specific disciplines or journals, and a lack of transparency on the methodology behind content indexing and result rankings. Researchers should verify sources before citing them in their work due to concerns with unfiltered resources that may include low-quality materials like predatory journals or self-published articles without rigorous peer-review processes.
Complementing Google Scholar with Specialized Databases
Is google scholar good for research? Yes, but complementing it with specialized databases makes it even better. To ensure access to high-quality information relevant to their field and carry out comprehensive searches without missing important publications, researchers should use specialized databases alongside Google Scholar.
By using multiple sources together, R&D managers, engineers, scientists, and innovation teams can leverage the strengths offered by each database while mitigating potential drawbacks associated with any single source.
Importance of Using PubMed or IEEE Xplore for Specific Disciplines
In addition to Google Scholar’s extensive coverage, it is crucial for researchers in specific disciplines such as life sciences or engineering to utilize specialized databases like PubMed or IEEE Xplore, respectively. These platforms offer more targeted search results and provide access to unique resources not available on Google Scholar.
For instance, PubMed includes biomedical literature from MEDLINE while IEEE Xplore houses a vast collection of technical papers related to electrical engineering and computer science.
Combining Scopus or Web of Science for Advanced Search Capabilities
Scopus and Web of Science, two multidisciplinary research databases that are often compared with Google Scholar due to their wide-ranging content coverage, offer advanced search capabilities that may be lacking in the latter platform. Some benefits include better filtering options, more comprehensive citation analysis, and higher-quality metadata.
Incorporating specialized databases like PubMed or IEEE Xplore along with multidisciplinary platforms such as Scopus or Web of Science can significantly enhance the efficiency and effectiveness of research efforts when used in conjunction with Google Scholar. Researchers can leverage the strengths of each database to obtain a more comprehensive view of the research landscape and make informed decisions based on the search results.
Key Takeaway:
To conduct comprehensive research, R&D teams should complement Google Scholar with specialized databases like PubMed or IEEE Xplore for specific disciplines and Scopus or Web of Science for advanced search capabilities. By using multiple sources together, researchers can leverage the strengths offered by each database while mitigating potential drawbacks associated with any single source to obtain a more comprehensive view of the research landscape.
Conclusion
So overall, is Google Scholar good for research? Yes, Google Scholar offers a user-friendly interface with extensive coverage of scholarly literature, a ranking system for relevance, and citation-tracking functionality. There are limitations associated with using Google Scholar exclusively for conducting research, however, you can counter this by complementing it with specialized databases to ensure high-quality and comprehensive searches.
If you’re looking for more ways to improve your R&D process or need help navigating available resources like Google Scholar effectively, contact Cypris and unlock your team’s potential! Our platform provides rapid time-to-insights, centralizing data sources for improved R&D and innovation team performance.
Is Google Scholar Good for Research? Exploring Pros & Cons
Blogs
•
XX
min read
A faster, more accurate way to explore innovation data—now available in Cypris.
For innovation teams, speed and accuracy aren’t optional—they’re critical. You need to quickly find all relevant documents, slice and dice datasets however you want, and trust that the results are complete and representative. With this in mind, we’ve upgraded how semantic search works inside Cypris.
Today, we’re launching an upgraded search infrastructure that gives users access to full, exact result sets—unlocking more powerful analysis, faster iteration, and deterministic filtering and charting.
Unlike traditional semantic or vector search engines—which make it difficult to count, filter, or chart large sets of matched documents—our new approach prioritizes transparency and performance while preserving semantic relevance.
Why we moved away from vector search
Our original implementation relied on semantic and vector search to capture the “meaning” behind user queries. But as our platform evolved, it became clear that these systems weren’t well-suited for our core use cases.
Users needed:
Deterministic filtering (e.g., "how many results match this atom?")
Transparent, complete result sets to power charts and dashboards
Fast, repeatable queries that don’t change subtly over time
Modern vector search systems don’t easily support this level of transparency. They return approximate matches and abstract similarity scores, often making it hard to understand why a document was returned—or whether it’s the full picture.
So we made a decision: move away from vector search and lean into what traditional search engines do best.
A return to boolean and lexical search—with a twist
We rebuilt our search infrastructure on top of Elasticsearch’s powerful boolean and lexical search capabilities. This shift brings major advantages:
Faster query speeds that dramatically improve iteration time
Deterministic filtering and counts, so every chart is grounded in the full dataset
Predictable, explainable results that users can trust
But we didn’t stop there.
To preserve the benefits of semantic understanding, we’ve rethought where that intelligence should live—not at query time, but at data ingestion.
Capturing semantic meaning at ingest time
Instead of computing document-query similarity during search, we enrich documents at the time of ingestion. Here’s how:
Synonym expansion: We find related words and concepts not explicitly mentioned in the document and add them as fields, enabling semantic-style recall via lexical search.
Stemming: Both queries and documents are reduced to their root forms, allowing consistent matches (e.g., “running” and “run”).
The result? You get the same functionality—semantically relevant results—without the opacity or latency tradeoffs of vector search.
What’s next: Reranking for even better relevance
We’re not done. Coming soon to Cypris is a reranking layer that boosts the most relevant results to the top of the list using lightweight vector techniques.
Here’s how it works:
A standard lexical search retrieves the full result set.
We take the top N results and rerank them using vector similarity, powered by Elasticsearch’s new hybrid scoring capabilities.
You get faster queries with even better relevance—without compromising on counts or transparency.
This layered approach gives us the best of both worlds: precise filtering and fast queries, plus smarter ordering of results where it matters most.
We’re excited to bring this upgrade to our users, and we’re already seeing teams iterate faster and uncover insights more confidently. This is a foundational shift—and just the beginning of what’s to come.
Want a walkthrough of what’s changed? Reach out to our team.