moj-analytical-services/splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
📋 What's Changed
- :dependabot: github-actions(deps): Bump github/codeql-action from 4.32.4 to 4.32.6 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2980
- :dependabot: uv(deps-dev): Bump mkdocs-material from 9.7.4 to 9.7.5 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2984
- Depandas - blocking analysis & tests batch 2 by @ADBond in https://github.com/moj-analytical-services/splink/pull/2985
- :dependabot: github-actions(deps): Bump astral-sh/setup-uv from 7.3.1 to 7.6.0 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2986
- Remove `TestHelper.convert frame()` by @ADBond in https://github.com/moj-analytical-services/splink/pull/2987
- :dependabot: uv(deps-dev): Bump sqlglot from 29.0.1 to 30.0.2 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2989
- Publish via Trusted Publishing by @ADBond in https://github.com/moj-analytical-services/splink/pull/2990
- Merge 4 -> 5 by @ADBond in https://github.com/moj-analytical-services/splink/pull/2996
- + 9 more
📋 What's Changed
- Splink 5 - Remove implicit cache by @RobinL in https://github.com/moj-analytical-services/splink/pull/2847
- Splink 5 - Explicit cache table mgt fns by @RobinL in https://github.com/moj-analytical-services/splink/pull/2848
- Splink 5 - Remove salting by @RobinL in https://github.com/moj-analytical-services/splink/pull/2849
- Splink 5 - Add chunking by @RobinL in https://github.com/moj-analytical-services/splink/pull/2850
- Splink5 - Bayes factors to match weights final by @RobinL in https://github.com/moj-analytical-services/splink/pull/2851
- Drop athena by @ADBond in https://github.com/moj-analytical-services/splink/pull/2858
- Minor fixes by @RobinL in https://github.com/moj-analytical-services/splink/pull/2859
- Update changelog by @RobinL in https://github.com/moj-analytical-services/splink/pull/2860
- + 43 more
📋 What's Changed
- :dependabot: uv(deps-dev): bump pymdown-extensions from 10.20.1 to 10.21 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2942
- :dependabot: github-actions(deps): bump actions/dependency-review-action from 4.8.2 to 4.8.3 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2947
- Bump locked dependencies by @ADBond in https://github.com/moj-analytical-services/splink/pull/2948
- :dependabot: github-actions(deps): bump github/codeql-action from 4.31.10 to 4.32.4 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2946
- :dependabot: github-actions(deps): bump lycheeverse/lychee-action from 2.7.0 to 2.8.0 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2943
- Remove select star at start of pipeline for aliasing splink4 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2962
- Blocking select only blocking cols splink 4 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2961
- Optimise link only exploding blocking rules by @RobinL in https://github.com/moj-analytical-services/splink/pull/2963
- + 3 more
📋 What's Changed
- Faster `two_dataset_link_only` joins when joining small table to large in duckdb by @RobinL in https://github.com/moj-analytical-services/splink/pull/2936
- 4_0_15 release by @RobinL in https://github.com/moj-analytical-services/splink/pull/2938
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.14...v4.0.15
📋 What's Changed
- Two dataset link only exploding blocking rule optimisation by @RobinL in https://github.com/moj-analytical-services/splink/pull/2931
- Filtered neighbours gets persisted by @RobinL in https://github.com/moj-analytical-services/splink/pull/2933
- 4.0.14 release and changelog by @RobinL in https://github.com/moj-analytical-services/splink/pull/2934
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.13...v4.0.14
📋 What's Changed
- Bump lockfile versions by @ADBond in https://github.com/moj-analytical-services/splink/pull/2871
- :octocat: GitHub Actions Updates by @jacobwoffenden in https://github.com/moj-analytical-services/splink/pull/2868
- add data city use case by @RobinL in https://github.com/moj-analytical-services/splink/pull/2891
- Make tests compatible with pandas 3 by @ADBond in https://github.com/moj-analytical-services/splink/pull/2895
- :dependabot: github-actions(deps): Bump github/codeql-action from 4.31.9 to 4.31.10 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2889
- :dependabot: uv(deps-dev): Bump sqlglot from 28.5.0 to 28.6.0 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2890
- :dependabot: github-actions(deps): Bump astral-sh/setup-uv from 7.1.6 to 7.2.0 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2884
- :dependabot: github-actions(deps): Bump actions/checkout from 6.0.1 to 6.0.2 by @dependabot[bot] in https://github.com/moj-analytical-services/splink/pull/2896
- + 12 more
✨ New Contributors
- @jacobwoffenden made their first contribution in https://github.com/moj-analytical-services/splink/pull/2868
- @Mostafa-Armandi made their first contribution in https://github.com/moj-analytical-services/splink/pull/2894
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.12...v4.0.13
📋 What's Changed
- Drop python 3 8 by @ADBond in https://github.com/moj-analytical-services/splink/pull/2819
- Test python 3.14 by @ADBond in https://github.com/moj-analytical-services/splink/pull/2820
- Fix docs code blocks in admonitions by @ADBond in https://github.com/moj-analytical-services/splink/pull/2824
- Fix 2821 by @aymonwuolanne in https://github.com/moj-analytical-services/splink/pull/2826
- fix extract major version, make it robust to dev variants by @RobinL in https://github.com/moj-analytical-services/splink/pull/2829
- Always include match key by @RobinL in https://github.com/moj-analytical-services/splink/pull/2831
- Remove unused methods by @RobinL in https://github.com/moj-analytical-services/splink/pull/2832
- [MAINT] Lockfile dependencies upgrade by @ADBond in https://github.com/moj-analytical-services/splink/pull/2838
- + 14 more
✨ Added
- Support for chunking to allow processing of very large datasets in blocking and prediction [#2850](https://github.com/moj-analytical-services/splink/pull/2850)
- New `table_management` functions to explicitly manage table caching [#2848](https://github.com/moj-analytical-services/splink/pull/2848)
📋 Changed
- Internal probabilistic calculations now use Match Weights (log-odds) instead of Bayes Factors to improve numerical stability [#2851](https://github.com/moj-analytical-services/splink/pull/2851)
🗑️ Deprecated
- `bayes_factor_column_prefix` setting is deprecated in favour of `match_weight_column_prefix` [#2851](https://github.com/moj-analytical-services/splink/pull/2851)
🗑️ Removed
- Dropped support for Amazon Athena [#2858](https://github.com/moj-analytical-services/splink/pull/2858)
- Removed implicit caching mechanism and the `use_cache` parameter from database execution methods [#2847](https://github.com/moj-analytical-services/splink/pull/2847)
- Removed `materialise_blocked_pairs` argument from `predict` (blocked pairs are now always materialised) [#2848](https://github.com/moj-analytical-services/splink/pull/2848)
- Removed salting mechanism as it is no longer required for parallelisation in DuckDB [#2849](https://github.com/moj-analytical-services/splink/pull/2849)
Pre-release to ensure #2844 works correctly
📋 What's Changed
- Upgrade lockfile by @ADBond in https://github.com/moj-analytical-services/splink/pull/2811
- Improve clustering performance by @aymonwuolanne in https://github.com/moj-analytical-services/splink/pull/2800
- Simplify docs build by @ADBond in https://github.com/moj-analytical-services/splink/pull/2813
- Add py.typed marker by @ADBond in https://github.com/moj-analytical-services/splink/pull/2815
- Waterfall cache by @RobinL in https://github.com/moj-analytical-services/splink/pull/2816
- Remove obsolete Spark 4 warning by @ADBond in https://github.com/moj-analytical-services/splink/pull/2818
- Release 4 0 11 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2817
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.10...v4.0.11
📋 What's Changed
- change NOT IN statement to NOT EXISTS statement by @aymonwuolanne in https://github.com/moj-analytical-services/splink/pull/2766
- Poetry -> uv by @ADBond in https://github.com/moj-analytical-services/splink/pull/2775
- Uncap sqlglot by @ADBond in https://github.com/moj-analytical-services/splink/pull/2785
- Bump minimum sqlglot version by @ADBond in https://github.com/moj-analytical-services/splink/pull/2786
- add laos by @RobinL in https://github.com/moj-analytical-services/splink/pull/2787
- Remove duplicated mypy check by @ADBond in https://github.com/moj-analytical-services/splink/pull/2788
- Fix more docs links by @ADBond in https://github.com/moj-analytical-services/splink/pull/2765
- Improve debug mode by @RobinL in https://github.com/moj-analytical-services/splink/pull/2789
- + 15 more
📋 What's Changed
- 2710 comparison viewer performance by @aymonwuolanne in https://github.com/moj-analytical-services/splink/pull/2716
- Apply pseudopeople tutorial feedback by @tylerdy in https://github.com/moj-analytical-services/splink/pull/2688
- Tie breaking in cluster_using_single_best_links by @aymonwuolanne in https://github.com/moj-analytical-services/splink/pull/2691
- update cookbook by @RobinL in https://github.com/moj-analytical-services/splink/pull/2722
- add use case by @RobinL in https://github.com/moj-analytical-services/splink/pull/2719
- typo and grammar fix by @daidoji in https://github.com/moj-analytical-services/splink/pull/2725
- Miscellaneous type-hinting by @ADBond in https://github.com/moj-analytical-services/splink/pull/2729
- Fixed another typo by @daidoji in https://github.com/moj-analytical-services/splink/pull/2727
- + 13 more
✨ New Contributors
- @daidoji made their first contribution in https://github.com/moj-analytical-services/splink/pull/2725
- @leppekja made their first contribution in https://github.com/moj-analytical-services/splink/pull/2731
- @calebhadley1 made their first contribution in https://github.com/moj-analytical-services/splink/pull/2767
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.8...v4.0.9
📋 What's Changed
- Lockfile update by @ADBond in https://github.com/moj-analytical-services/splink/pull/2649
- add dbt by @RobinL in https://github.com/moj-analytical-services/splink/pull/2658
- Upgrade Vega to 5.31.0 by @hedsnz in https://github.com/moj-analytical-services/splink/pull/2627
- use unique_id from settings by @RobinL in https://github.com/moj-analytical-services/splink/pull/2659
- Bump lockfile versions by @ADBond in https://github.com/moj-analytical-services/splink/pull/2664
- add princeton paper by @RobinL in https://github.com/moj-analytical-services/splink/pull/2669
- Add link to pydata by @RobinL in https://github.com/moj-analytical-services/splink/pull/2675
- Add PyData Global talk to md by @RobinL in https://github.com/moj-analytical-services/splink/pull/2676
- + 15 more
✨ New Contributors
- @hedsnz made their first contribution in https://github.com/moj-analytical-services/splink/pull/2627
- @tylerdy made their first contribution in https://github.com/moj-analytical-services/splink/pull/2665
- @rhyswilliams2 made their first contribution in https://github.com/moj-analytical-services/splink/pull/2696
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.7...v4.0.8
📋 What's Changed
- Add speed tests to docs by @RobinL in https://github.com/moj-analytical-services/splink/pull/2538
- Llm prompt to docs by @RobinL in https://github.com/moj-analytical-services/splink/pull/2541
- Fix typos in docs by @RobinL in https://github.com/moj-analytical-services/splink/pull/2542
- improve llm prompt by @RobinL in https://github.com/moj-analytical-services/splink/pull/2543
- Link to custom GPT by @RobinL in https://github.com/moj-analytical-services/splink/pull/2544
- Test python 3.13 by @ADBond in https://github.com/moj-analytical-services/splink/pull/2521
- Fix reference to similarity_jar_location by @julijonas in https://github.com/moj-analytical-services/splink/pull/2547
- Deprecation warning for python 3.8 by @ADBond in https://github.com/moj-analytical-services/splink/pull/2520
- + 51 more
✨ New Contributors
- @julijonas made their first contribution in https://github.com/moj-analytical-services/splink/pull/2547
- @medwar99 made their first contribution in https://github.com/moj-analytical-services/splink/pull/2559
- @gidelpanta made their first contribution in https://github.com/moj-analytical-services/splink/pull/2590
- @BenNBEIS made their first contribution in https://github.com/moj-analytical-services/splink/pull/2593
- @amaiaita made their first contribution in https://github.com/moj-analytical-services/splink/pull/2620
- @b-d-e made their first contribution in https://github.com/moj-analytical-services/splink/pull/2623
- @mpalomares-he made their first contribution in https://github.com/moj-analytical-services/splink/pull/2631
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.6...v4.0.7
📋 What's Changed
- Explicit selection by @ADBond in https://github.com/moj-analytical-services/splink/pull/2484
- Fix clustering in debug mode by @ADBond in https://github.com/moj-analytical-services/splink/pull/2485
- Less caching in debug mode by @ADBond in https://github.com/moj-analytical-services/splink/pull/2488
- Update changelog by @RobinL in https://github.com/moj-analytical-services/splink/pull/2497
- remove unnecessary import by @lubrst in https://github.com/moj-analytical-services/splink/pull/2500
- Spark test session handling by @ADBond in https://github.com/moj-analytical-services/splink/pull/2504
- Fix count_comparisons_from_blocking_rule by @RobinL in https://github.com/moj-analytical-services/splink/pull/2503
- Streamline docs by @RobinL in https://github.com/moj-analytical-services/splink/pull/2505
- + 13 more
✨ New Contributors
- @lubrst made their first contribution in https://github.com/moj-analytical-services/splink/pull/2500
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.5...v4.0.6
📋 What's Changed
- add EMA use case by @RobinL in https://github.com/moj-analytical-services/splink/pull/2468
- Change name of second __splink__cluster_count_row_numbered query, prevent table name conflict by @browo097302 in https://github.com/moj-analytical-services/splink/pull/2447
- Add iteration number to `neighbours_filtered` table by @ADBond in https://github.com/moj-analytical-services/splink/pull/2470
- Fix docs examples by @ADBond in https://github.com/moj-analytical-services/splink/pull/2471
- Docs - correct heading and link text by @ADBond in https://github.com/moj-analytical-services/splink/pull/2472
- Simplify Altair import by @ADBond in https://github.com/moj-analytical-services/splink/pull/2479
- Specify version range for `pytest-cov` in CI by @ADBond in https://github.com/moj-analytical-services/splink/pull/2489
- Compare two records - allow dataframes to be registered by @RobinL in https://github.com/moj-analytical-services/splink/pull/2493
- + 2 more
📋 What's Changed
- Handle threshold_match_probablity 0 in predict() #2420 by @browo097302 in https://github.com/moj-analytical-services/splink/pull/2425
- Take converged clusters out of play by @RobinL in https://github.com/moj-analytical-services/splink/pull/2436
- Fix clustering in linky jobs with source dataset column on Postgres by @ADBond in https://github.com/moj-analytical-services/splink/pull/2444
- Cluster multiple thresholds v2 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2437
- Used .blocking_rule_sql property match_weights_interactive_history_chart() by @browo097302 in https://github.com/moj-analytical-services/splink/pull/2446
- restore pretty print of SplinkDataFrame by @RobinL in https://github.com/moj-analytical-services/splink/pull/2450
- 2440 add docstring to customrule by @RobinL in https://github.com/moj-analytical-services/splink/pull/2452
- Cluster multiple add stats by @RobinL in https://github.com/moj-analytical-services/splink/pull/2453
- + 5 more
✨ New Contributors
- @browo097302 made their first contribution in https://github.com/moj-analytical-services/splink/pull/2425
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.3...v4.0.4
📋 What's Changed
- fix dead links by @RobinL in https://github.com/moj-analytical-services/splink/pull/2430
- Cluster without linker by @RobinL in https://github.com/moj-analytical-services/splink/pull/2412
- Better autocomplete for dataframes by @RobinL in https://github.com/moj-analytical-services/splink/pull/2434
- v4.0.3 release by @RobinL in https://github.com/moj-analytical-services/splink/pull/2435
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.2...v4.0.3
📋 What's Changed
- Fix performance issue with exploding blocking rules by @RobinL in https://github.com/moj-analytical-services/splink/pull/2385
- Add cookbook to examples by @RobinL in https://github.com/moj-analytical-services/splink/pull/2388
- fix docs by @RobinL in https://github.com/moj-analytical-services/splink/pull/2389
- Create llm prompt by @RobinL in https://github.com/moj-analytical-services/splink/pull/2366
- 2351 fix spark sampling by @aymonwuolanne in https://github.com/moj-analytical-services/splink/pull/2390
- Improve number formatting and descriptions on match weight charts by @RobinL in https://github.com/moj-analytical-services/splink/pull/2392
- add labelling tool by @RobinL in https://github.com/moj-analytical-services/splink/pull/2393
- Fix ColumnsReversedLevel by @RobinL in https://github.com/moj-analytical-services/splink/pull/2395
- + 15 more
📋 What's Changed
- Bias blog by @ericakane-moj in https://github.com/moj-analytical-services/splink/pull/2279
- Fix bug in Postgres example by @fhightower in https://github.com/moj-analytical-services/splink/pull/2352
- Added new use case to index.md by @AnthonyTacquet in https://github.com/moj-analytical-services/splink/pull/2363
- Fixing issue with reaonly filesystems by @RossHammer in https://github.com/moj-analytical-services/splink/pull/2357
- Update changelog by @ADBond in https://github.com/moj-analytical-services/splink/pull/2370
- avoid attempting to cast `Infinity` to double for spark backend by @bkitej-rw in https://github.com/moj-analytical-services/splink/pull/2372
- Fix Spark 'InfinityD' bug by @ADBond in https://github.com/moj-analytical-services/splink/pull/2374
- Support duckdbpyrelation as input type by @RobinL in https://github.com/moj-analytical-services/splink/pull/2375
- + 7 more
✨ New Contributors
- @probjects made their first contribution in https://github.com/moj-analytical-services/splink/pull/2172
- @DavidFrenchSG made their first contribution in https://github.com/moj-analytical-services/splink/pull/2204
- @astimoore made their first contribution in https://github.com/moj-analytical-services/splink/pull/2229
- @dkaufman-rc made their first contribution in https://github.com/moj-analytical-services/splink/pull/2240
- @ericakane-moj made their first contribution in https://github.com/moj-analytical-services/splink/pull/2277
- @bnm3k made their first contribution in https://github.com/moj-analytical-services/splink/pull/2342
- @fhightower made their first contribution in https://github.com/moj-analytical-services/splink/pull/2352
- @AnthonyTacquet made their first contribution in https://github.com/moj-analytical-services/splink/pull/2363
- + 3 more
See https://moj-analytical-services.github.io/splink/blog/2024/07/24/splink-400-released.html for release announcement
📋 What's Changed
- Comparison that has tf adjustments = True properly accounts for column expressions by @RobinL in https://github.com/moj-analytical-services/splink/pull/2267
- Adjust package top level imports by @ADBond in https://github.com/moj-analytical-services/splink/pull/2269
- Evaluation docstrings by @RobinL in https://github.com/moj-analytical-services/splink/pull/2271
- Remove broken EM training options by @ADBond in https://github.com/moj-analytical-services/splink/pull/2272
- Restore lat-long SQL test by @ADBond in https://github.com/moj-analytical-services/splink/pull/2273
- Consistent `db_api` argument name by @ADBond in https://github.com/moj-analytical-services/splink/pull/2278
- Turn off previously configured options by @ADBond in https://github.com/moj-analytical-services/splink/pull/2276
- Remove jan 1st option from date of birth comparison by @RobinL in https://github.com/moj-analytical-services/splink/pull/2281
- + 5 more
📋 What's Changed
- Docs links by @RobinL in https://github.com/moj-analytical-services/splink/pull/2237
- Cherrypick various patches to master by @RobinL in https://github.com/moj-analytical-services/splink/pull/2241
- Update docstrings splink4 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2246
- as spark dataframe in docs by @RobinL in https://github.com/moj-analytical-services/splink/pull/2247
- More docstrings by @RobinL in https://github.com/moj-analytical-services/splink/pull/2248
- Docstrings 3 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2250
- Restore spark test mark by @ADBond in https://github.com/moj-analytical-services/splink/pull/2253
- add note about excludedocs by @RobinL in https://github.com/moj-analytical-services/splink/pull/2256
- + 9 more
📋 What's Changed
- Update docs for Splink4 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2203
- Update comparison template library by @RobinL in https://github.com/moj-analytical-services/splink/pull/2214
- Further splink4 docs work by @RobinL in https://github.com/moj-analytical-services/splink/pull/2215
- Move comparison helpers by @RobinL in https://github.com/moj-analytical-services/splink/pull/2216
- Restore dev guides by @RobinL in https://github.com/moj-analytical-services/splink/pull/2217
- add back tags by @RobinL in https://github.com/moj-analytical-services/splink/pull/2218
- Splink4 docs: fix more links by @RobinL in https://github.com/moj-analytical-services/splink/pull/2225
- Athena linker splink4 migration by @RobinL in https://github.com/moj-analytical-services/splink/pull/2226
- + 5 more
📋 What's Changed
- Document first-time developer setup, add conda option by @zmbc in https://github.com/moj-analytical-services/splink/pull/2083
- fix links by @RobinL in https://github.com/moj-analytical-services/splink/pull/2097
- Add dirty reload for much faster updates by @RobinL in https://github.com/moj-analytical-services/splink/pull/2096
- Add documentation for spellchecker and spellcheck docs by @zslade in https://github.com/moj-analytical-services/splink/pull/2025
- Add graph definition to docs by @zslade in https://github.com/moj-analytical-services/splink/pull/1979
- Minor fixes to spellchecker by @zslade in https://github.com/moj-analytical-services/splink/pull/2113
- Changing args as kwargs by @jlb52 in https://github.com/moj-analytical-services/splink/pull/2116
- Update threshold_selection_tool.json by @aalexandersson in https://github.com/moj-analytical-services/splink/pull/2120
- + 22 more
✨ New Contributors
- @zmbc made their first contribution in https://github.com/moj-analytical-services/splink/pull/2083
- @jlb52 made their first contribution in https://github.com/moj-analytical-services/splink/pull/2116
- @aalexandersson made their first contribution in https://github.com/moj-analytical-services/splink/pull/2120
- @probjects made their first contribution in https://github.com/moj-analytical-services/splink/pull/2172
- @DavidFrenchSG made their first contribution in https://github.com/moj-analytical-services/splink/pull/2204
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v3.9.14...v3.9.15
📋 What's Changed
- Group linker functions thematically by @RobinL in https://github.com/moj-analytical-services/splink/pull/2192
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.0.dev5...v4.0.0.dev6
📋 What's Changed
- Further tidying of blocking analysis - better typing by @RobinL in https://github.com/moj-analytical-services/splink/pull/2186
- Misc tidying by @ADBond in https://github.com/moj-analytical-services/splink/pull/2182
- Consolidate accuracy 2 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2187
- Move all backend functions to internals/ directory by @RobinL in https://github.com/moj-analytical-services/splink/pull/2189
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.0.dev4...v4.0.0.dev5
📋 What's Changed
- Simple extension to term frequency adjustments for inexact matches by @samkodes in https://github.com/moj-analytical-services/splink/pull/2020
- Update bug report template by @ADBond in https://github.com/moj-analytical-services/splink/pull/2073
- update colab links by @RobinL in https://github.com/moj-analytical-services/splink/pull/2080
- Fix mkdocs rendering symbols in notebook code by @ADBond in https://github.com/moj-analytical-services/splink/pull/2033
- Enqueue and compute methods by @RobinL in https://github.com/moj-analytical-services/splink/pull/2086
- rm deprecated action and bash scripts by @ThomasHepworth in https://github.com/moj-analytical-services/splink/pull/2094
- Fix sqlglot>=23.0.0 issue by @RobinL in https://github.com/moj-analytical-services/splink/pull/2079
- 3.9.14 release by @RobinL in https://github.com/moj-analytical-services/splink/pull/2095
- + 46 more
✨ New Contributors
- @samkodes made their first contribution in https://github.com/moj-analytical-services/splink/pull/2020
- @jlb52 made their first contribution in https://github.com/moj-analytical-services/splink/pull/2116
- @aalexandersson made their first contribution in https://github.com/moj-analytical-services/splink/pull/2120
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v4.0.0.dev3...v4.0.0.dev4
📋 What's Changed
- Update u probability formula and example in fellegi_sunter.md by @jacuna88 in https://github.com/moj-analytical-services/splink/pull/2036
- Splink 3: Increment minimum python version from 3.7 to 3.8 by @RobinL in https://github.com/moj-analytical-services/splink/pull/2031
- Make graph metrics public by @zslade in https://github.com/moj-analytical-services/splink/pull/2027
- Add PUDL to list of use cases by @zaneselvans in https://github.com/moj-analytical-services/splink/pull/2044
- Threshold selection tool by @samnlindsay in https://github.com/moj-analytical-services/splink/pull/2003
- Simple extension to term frequency adjustments for inexact matches by @samkodes in https://github.com/moj-analytical-services/splink/pull/2020
- Update bug report template by @ADBond in https://github.com/moj-analytical-services/splink/pull/2073
- Fix mkdocs rendering symbols in notebook code by @ADBond in https://github.com/moj-analytical-services/splink/pull/2033
- + 3 more
✨ New Contributors
- @jacuna88 made their first contribution in https://github.com/moj-analytical-services/splink/pull/2036
- @zaneselvans made their first contribution in https://github.com/moj-analytical-services/splink/pull/2044
- @samkodes made their first contribution in https://github.com/moj-analytical-services/splink/pull/2020
- Full Changelog: https://github.com/moj-analytical-services/splink/compare/v3.9.13...v3.9.14
