This kind of test is also more business-oriented, because they focus on testing external, visible behaviors (so it’s kind of “BDD in code”). You don’t have to think about internal implementation details anymore. In most cases, you don’t even need mocks, it’s enough to use hand-written Test Doubles/Fakes/Stubs (e.g. an in-memory list that simulates a database, fake time provider, etc) (tests with Fakes are much cleaner).
Task: Implement Large Lempel-Ziv. As a benchmark, compress all of Project Gutenberg; evaluate the resulting model numerically for autoregressive tokens/second and its ability to compress the Gutenberg corpus (perplexity), and qualitatively for its ability to write prose, technical documentation, code, poems, translations, one-shot prompts, etc. Provide a Nix flake which can be used to reproduce all results.
,更多细节参见PDF资料
报道称,这款设备基于现有品牌手机改造而成,预计将在年底或明年初推向民用市场,标志着量子保密通信技术从国家级重要基础设施向普通消费者生活的延伸。
Additional reporting by Paul Glynn, Tara Mewawalla and Annabel Rackham