How the Sanskrit Unicode Text Search Works

Our Sanskrit search engine is optimized to look for free, downloadable versions of Sanskrit texts that are encoded in Unicode formats. Sites that are known to have high-quality text archives are given priority in search results. The prioritization algorithm uses a list of sites that have been checked for both content and technical setup. Other sites also may be included in search results but they get a lower ranking.

To be included in the prioritization algorithm sites must provide Unicode versions of Sanskrit texts for free public download. Sites that contain collections of Sanskrit texts in other encodings may be included if they contain at least some Unicode texts, or if their repositories are set up with consistent encoding standards that are clearly documented, allowing conversion to Unicode by researchers. Sites that offer search services for texts but do not allow the researcher to download the full source text itself are not given priority status. Sites that require user registration are excluded from search results (blacklisted).

As of 17 February 2011 the list of priority archival sites includes, in no particular order: