We have developed a robust merging algorithm for error-prone time-dependent string data. This repository provides out-of-the-box functionality for merging time-series string data. Our goal is for this to be used in string preprocessing and merging stages in real-time voice data processing pipelines, such as ASR (Automatic Speech Recognition).
use string_combiner::StringCombiner;
fn main() {
let inputs = vec![
"Hello World".chars(),
"Hello worl d!".chars(),
"내 어린시절 우연히?".chars(),
"시찰 우연히 들었던 ".chars(),
"우연히 들었던 믿지 못할 한 마디".chars(),
"Hello bold".chars(),
];
let combiner = StringCombiner {
threshold_deletion_x: 3,
..Default::default()
};
let combined = combiner
.concat_strings(inputs)
.expect("Failed to concat texts");
let expected = "내 어린시절 우연히 들었던 믿지 못할 한 마디";
assert_eq!(expected, combined);
}
Please check our LICENSE file.