[go: up one dir, main page]

Skip to content
View soldni's full-sized avatar
🏳️‍🌈
vibing!
🏳️‍🌈
vibing!

Organizations

@Georgetown-IR-Lab @allenai

Block or report soldni

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. allenai/dolma allenai/dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1k 110

  2. allenai/smashed allenai/smashed Public

    SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata …

    Python 31 3

  3. springs springs Public

    A set of utilities to turn Dataclasses into useful configuration managers.

    Python 11 2

  4. Georgetown-IR-Lab/QuickUMLS Georgetown-IR-Lab/QuickUMLS Public

    System for Medical Concept Extraction and Linking

    Python 382 95

  5. trouting trouting Public

    Type Routing (trouting) is a decorator that selects the right method in a class based on the input data type

    Python 2

  6. pyterrier_sentence_transformers pyterrier_sentence_transformers Public

    Create PyTerrier compatible dense indices using any sentence_transformers model

    Python 5 3