Class NameCentricRetrieval

    • Field Detail

      • CONJUNCTION

        public static final de.julielab.geneexpbase.candidateretrieval.QueryGenerator CONJUNCTION
      • DISJUNCTION

        public static final de.julielab.geneexpbase.candidateretrieval.QueryGenerator DISJUNCTION
      • DISJUNCTION_MINUS_1

        public static final de.julielab.geneexpbase.candidateretrieval.QueryGenerator DISJUNCTION_MINUS_1
      • DISJUNCTION_MINUS_2

        public static final de.julielab.geneexpbase.candidateretrieval.QueryGenerator DISJUNCTION_MINUS_2
      • NGRAM_2_3

        public static final de.julielab.geneexpbase.candidateretrieval.QueryGenerator NGRAM_2_3
      • TOKEN_JAROWINKLER_SCORER

        public static final int TOKEN_JAROWINKLER_SCORER
        See Also:
        Constant Field Values
      • candidateLog

        public static final org.slf4j.Logger candidateLog
      • LUCENE_MAX_HITS

        public static final int LUCENE_MAX_HITS
        the maximal number of hits lucene returns for a query
        See Also:
        Constant Field Values
    • Constructor Detail

      • NameCentricRetrieval

        @Inject
        public NameCentricRetrieval​(Configuration config,
                                    de.julielab.geneexpbase.services.CacheService cacheService)
                             throws de.julielab.geneexpbase.candidateretrieval.GeneCandidateRetrievalException
        Throws:
        de.julielab.geneexpbase.candidateretrieval.GeneCandidateRetrievalException
    • Method Detail

      • getTotalCacheGettime

        public static AtomicLong getTotalCacheGettime()
      • getTotalCachePuttime

        public static AtomicLong getTotalCachePuttime()
      • getTotalLuceneQueryTime

        public static AtomicLong getTotalLuceneQueryTime()
      • shutdownExecutor

        public static void shutdownExecutor()
      • getNormalizer

        public de.julielab.geneexpbase.TermNormalizer getNormalizer()
      • setNormalizer

        public void setNormalizer​(de.julielab.geneexpbase.TermNormalizer normalizer)
      • getScorer

        public de.julielab.geneexpbase.scoring.Scorer getScorer()
      • setScorerType

        public de.julielab.geneexpbase.scoring.Scorer setScorerType​(int type)
                                                             throws de.julielab.geneexpbase.candidateretrieval.GeneCandidateRetrievalException
        Throws:
        de.julielab.geneexpbase.candidateretrieval.GeneCandidateRetrievalException
      • getScorerInfo

        public String getScorerInfo()
      • getScorerType

        public int getScorerType()
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(String originalSearchTerm,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface CandidateRetrieval
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(de.julielab.geneexpbase.genemodel.GeneMention geneMention,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface de.julielab.geneexpbase.candidateretrieval.CandidateRetrieval
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(de.julielab.geneexpbase.genemodel.GeneMention geneMention,
                                                                                     Collection<String> organisms,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface de.julielab.geneexpbase.candidateretrieval.CandidateRetrieval
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(de.julielab.geneexpbase.genemodel.GeneMention geneMention,
                                                                                     Collection<String> geneIdsFilter,
                                                                                     Collection<String> organisms,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface de.julielab.geneexpbase.candidateretrieval.CandidateRetrieval
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(de.julielab.geneexpbase.genemodel.GeneMention geneMention,
                                                                                     Collection<String> geneIdsFilter,
                                                                                     Collection<String> organisms,
                                                                                     boolean loadFields,
                                                                                     de.julielab.geneexpbase.configuration.Parameters parameters,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface CandidateRetrieval
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(de.julielab.geneexpbase.genemodel.GeneMention geneMention,
                                                                                     Collection<String> geneIdsFilter,
                                                                                     Collection<String> organisms,
                                                                                     boolean loadFields,
                                                                                     de.julielab.geneexpbase.configuration.Parameters parameters,
                                                                                     int numReturnedHits,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(String geneMentionText,
                                                                                     Collection<String> geneIdsFilter,
                                                                                     Collection<String> organism,
                                                                                     boolean loadFields,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(de.julielab.geneexpbase.genemodel.GeneMention geneMention,
                                                                                     String organism,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface CandidateRetrieval
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(String geneMentionText,
                                                                                     String organism,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface CandidateRetrieval
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(String geneMentionText,
                                                                                     Collection<String> organism,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface CandidateRetrieval
      • mapGeneIdToTaxId

        public String mapGeneIdToTaxId​(String geneId)
        Specified by:
        mapGeneIdToTaxId in interface de.julielab.geneexpbase.candidateretrieval.CandidateRetrieval
      • scoreIdsByNGramSynonyms

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> scoreIdsByNGramSynonyms​(String synonymsString,
                                                                                               Set<String> geneIds)
      • getCandidates

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getCandidates​(de.julielab.geneexpbase.genemodel.GeneMention gm,
                                                                                     Collection<String> taxId,
                                                                                     de.julielab.geneexpbase.configuration.Parameters parameters,
                                                                                     de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Specified by:
        getCandidates in interface CandidateRetrieval
      • scoreSynonymsRecordIndex

        public org.apache.commons.lang3.tuple.Pair<Map<String,​Double>,​Map<String,​List<String>>> scoreSynonymsRecordIndex​(Collection<de.julielab.geneexpbase.genemodel.GeneName> allSynonyms,
                                                                                                                                           Set<String> ids,
                                                                                                                                           Function<GeneRecordHit,​String[]> synhit2namesFunc,
                                                                                                                                           de.julielab.geneexpbase.candidateretrieval.QueryGenerator qg)

        Scores each synonym in allSynonym against the IDs in ids.

        Each resulting SynHit adds its mention score to the ID represented by this SynHit.

        Parameters:
        allSynonyms -
        ids -
        qg -
        Returns:
      • scoreIdsByExactSynonyms

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> scoreIdsByExactSynonyms​(Collection<String> allSynonyms,
                                                                                               Set<String> geneIds)
      • getPriorityNames

        public List<String> getPriorityNames​(String id,
                                             int priority)
      • getFamilyNames

        public List<de.julielab.geneexpbase.candidateretrieval.SynHit> getFamilyNames​(de.julielab.geneexpbase.genemodel.GeneMention gm,
                                                                                      de.julielab.geneexpbase.candidateretrieval.QueryGenerator queryGenerator)
        Description copied from interface: CandidateRetrieval
        Searches the index for the given gene mention filtered for family names.
        Specified by:
        getFamilyNames in interface CandidateRetrieval
        Parameters:
        gm - The gene mention to check for family names.
        queryGenerator - The query generator to use.
        Returns: