Skip to content

Commit c741a3a

Browse files
committed
Updated documentation for shiny
1 parent 97a4bcd commit c741a3a

File tree

2 files changed

+160
-33
lines changed

2 files changed

+160
-33
lines changed
Lines changed: 75 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,79 @@
1-
<img src="https://user-images.githubusercontent.com/89118428/155393065-780381a0-ff77-45d3-b2ee-40332ef72064.png" width="200" align="right"/>
1+
## About CiteSource
22

3-
CiteSource was developed in order to provide researchers the ability to examine the utility and efficacy of literature resources and search methodologies. The idea behind CiteSource is simply allowing users to deduplicate citation records, while maintaining customizable metadata about the citation.
3+
<img src="https://user-images.githubusercontent.com/89118428/155393065-780381a0-ff77-45d3-b2ee-40332ef72064.png" width="200" style="float: right; margin-left: 20px; margin-bottom: 10px;"/>
44

5-
Customizable metadata can include anything from a resource name (ex.Web of Science, LENS.org, PubMed), a method (database search, handsearching, citation chasing/ snowballing), a variation used within a method (WoS string #1, Wos string #2, WoS string #3), a research phase (search, TI/AB screening, Full-text Screening), or a unique group of citations (benchmarking articles, articles from a previous review, articles with a specific author affiliation).
5+
CiteSource is an R package and accompanying Shiny web application designed to support evidence data-driven decision-making during search strategy development. CiteSource also allows users to analyze and report on the impact of information sources and search methods.
66

7-
The CiteSource deduplication process is better described as record merging process due to the fact that the customizable metadata from duplicate records is maintained through the creation of a single, primary record. Beyond the merging of customizable metadata, the primary record is created by using the most complete metadata available between duplicate records (currently DOI and Abstract fields).
7+
CiteSource was developed as part of the [Evidence Synthesis Hackathon](https://www.eshackathon.org/) initiative.
88

9-
Once records are deduplicated, users are able to easily create plots and tables in order to answer specific questions or to simply explore the data to an effort to develop new hypotheses. Examples of analysis may include how many unique records a specific source contributed or how traditional methods of searching fare against a new AI discovery tool in discovering relevant articles. Users may want to understand the overlap in records between two different search strings or evaluate the impact of including Google Scholar in a review. Before searching, a user may even develop a targeted search to better understand the topical coverage across databases that they intend to search, and once the search has been developed, how particular source, string, or method performed in discovering benchmarking articles.
9+
---
10+
11+
### Key Features:
12+
13+
<details>
14+
<summary><strong>Flexible Metadata for Provenance Tracking:</strong></summary>
15+
16+
> * A core strength of CiteSource is its ability to assign and retain custom metadata to track the *provenance* of each citation – precisely where and how it was found. Users can tag records using three key fields:
17+
> * `cite_source`: Identify the origin database ('Web of Science', 'Scopus'), platform ('Lens.org'), or the specific search method used ('Citation Searching', 'String_1').
18+
> * `cite_label`: Track citations through screening phases using standardized terms: `search` (for initial results, benchmarks), `screened` (for records passing title/abstract review), and `final` (for records included in the synthesis after full-text review).
19+
> * `cite_string`: Add further detail, such as variations in search string syntax tested ('String_1a_truncation'), specific supplementary methods ('Handsearching_JournalX'), or other custom categories relevant to analysis.
20+
> * This detailed tagging enables rigorous analysis of the performance and contribution of each component of your overall search strategy.
21+
22+
</details>
23+
24+
<details>
25+
<summary><strong>Advanced Deduplication & Intelligent Merging:</strong></summary>
26+
27+
> * CiteSource employs the [`ASySD` (Automated Systematic Search Deduplicator) R package](https://github.com/camaradesuk/ASySD) to perform robust identification and merging of duplicate records.
28+
> * It conducts both *internal deduplication* (identifying duplicates within a single uploaded file/source, resulting in *distinct* records) and *external deduplication* (identifying duplicates across all uploaded files/sources, resulting in the set of *unique* citations).
29+
> * The process uses *intelligent merging*: custom metadata tags (`source`, `label`, `string`) from all identified duplicates are combined onto the primary record, preserving the full discovery history.
30+
> * The most complete bibliographic data (prioritizing DOI, Abstract) across duplicates is retained in the primary record.
31+
> * An optional *manual review* stage presents potential duplicates that fall below the automatic matching threshold, allowing users to confirm or reject merges for maximum accuracy.
32+
33+
</details>
34+
35+
<details>
36+
<summary><strong>Data-Driven Analysis & Visualization:</strong></summary>
37+
38+
> * Once deduplication is complete, CiteSource offers a suite of analysis and visualization tools designed specifically to speed up the *iterative process* of developing, testing, and validating search strategies:
39+
> * Visualize Overlap: Use interactive **Heatmaps** (pairwise overlap) and **Upset Plots** (multi-set intersections) to understand shared and unique records across sources, labels, or strings.
40+
> * Track Phase Progression: Employ the **Phase Analysis plot** (bar chart) to see the flow of unique and duplicate records through screening stages (`search` -> `screened` -> `final`).
41+
> * Generate Summary Tables: Access quantitative insights via automated tables detailing:
42+
> * Initial Record counts (showing the impact of internal deduplication).
43+
> * Record Summaries (detailing unique/overlapping records contributed by each source/method).
44+
> * Precision/Sensitivity calculations (evaluating source/method performance against the `final` included set).
45+
> * A detailed, interactive **Record Level Table** for quickly examining and linking to citations .
46+
47+
</details>
48+
49+
<details>
50+
<summary><strong>Enhanced Reporting & Transparent Export:</strong></summary>
51+
52+
> * CiteSource facilitates *transparent reporting* of search methods and results, aligning with guidelines like PRISMA.
53+
> * Export your final, deduplicated dataset in standard bibliographic formats (`.csv`, `.ris`, `.bib`).
54+
> * The custom metadata is embedded directly into standard fields within the export files (e.g., using C1, C2, C7, C8, DB fields in `.ris` format), providing a clear and reproducible audit trail for your methodology.
55+
56+
</details>
57+
58+
---
59+
60+
### Why use CiteSource for Evidence Synthesis?
61+
62+
CiteSource is built for anyone involved in evidence synthesis. It helps you:
63+
64+
* **Evaluate and optimize** information source selection based on unique record contributions.
65+
* **Refine and optimize** search strings by quickly testing variations.
66+
* **Analyze and report** the added value of different search methods, including supplementary searching techniques.
67+
* **Perform benchmark testing** to ensure key articles are captured by your strategy.
68+
* **Increase transparency and effectiveness** of your search strategy and processess through built-in tables for reporting.
69+
* **Save time** during the iterative search development.
70+
71+
### What Other Applications does CiteSource Serve ?
72+
73+
* Training in evidence synthesis search methods - MLIS classroom use for skill/knowledge development.
74+
* Methods research & development - large-scale methods testing, quick/live updates to analysis.
75+
* Library collection development - analyzing coverage of new databases compared to current subscriptions.
76+
77+
---
78+
79+
*CiteSource is available both as this interactive Shiny application and as a full R package with detailed vignettes. For more information on the R package, visit the [CiteSource Website](https://www.eshackathon.org/CiteSource/).*

0 commit comments

Comments
 (0)