[INFO] cloning repository https://github.com/Nu11ified/picochat [INFO] running `Command { std: "git" "-c" "credential.helper=" "-c" "credential.helper=/workspace/cargo-home/bin/git-credential-null" "clone" "--bare" "https://github.com/Nu11ified/picochat" "/workspace/cache/git-repos/https%3A%2F%2Fgithub.com%2FNu11ified%2Fpicochat", kill_on_drop: false }` [INFO] [stderr] Cloning into bare repository '/workspace/cache/git-repos/https%3A%2F%2Fgithub.com%2FNu11ified%2Fpicochat'... [INFO] running `Command { std: "git" "rev-parse" "HEAD", kill_on_drop: false }` [INFO] [stdout] b1612dd8ce896eaf12bfc50b3df34abde51cf36e [INFO] testing Nu11ified/picochat against beta-2026-04-21 for beta-1.96-1 [INFO] running `Command { std: "git" "clone" "/workspace/cache/git-repos/https%3A%2F%2Fgithub.com%2FNu11ified%2Fpicochat" "/workspace/builds/worker-2-tc2/source", kill_on_drop: false }` [INFO] [stderr] Cloning into '/workspace/builds/worker-2-tc2/source'... [INFO] [stderr] done. [INFO] started tweaking git repo https://github.com/Nu11ified/picochat [INFO] finished tweaking git repo https://github.com/Nu11ified/picochat [INFO] tweaked toml for git repo https://github.com/Nu11ified/picochat written to /workspace/builds/worker-2-tc2/source/Cargo.toml [INFO] validating manifest of git repo https://github.com/Nu11ified/picochat on toolchain beta-2026-04-21 [INFO] running `Command { std: CARGO_HOME="/workspace/cargo-home" RUSTUP_HOME="/workspace/rustup-home" "/workspace/cargo-home/bin/cargo" "+beta-2026-04-21" "metadata" "--manifest-path" "Cargo.toml" "--no-deps", kill_on_drop: false }` [INFO] crate git repo https://github.com/Nu11ified/picochat already has a lockfile, it will not be regenerated [INFO] running `Command { std: CARGO_HOME="/workspace/cargo-home" RUSTUP_HOME="/workspace/rustup-home" "/workspace/cargo-home/bin/cargo" "+beta-2026-04-21" "fetch" "--manifest-path" "Cargo.toml", kill_on_drop: false }` [INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-2-tc2/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-2-tc2/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+beta-2026-04-21" "metadata" "--no-deps" "--format-version=1", kill_on_drop: false }` [INFO] [stdout] 097cbd5766890814464c58e7dd6cb0d6f1309849d56dbdf5820a2f705bad77db [INFO] running `Command { std: "docker" "start" "-a" "097cbd5766890814464c58e7dd6cb0d6f1309849d56dbdf5820a2f705bad77db", kill_on_drop: false }` [INFO] running `Command { std: "docker" "inspect" "097cbd5766890814464c58e7dd6cb0d6f1309849d56dbdf5820a2f705bad77db", kill_on_drop: false }` [INFO] running `Command { std: "docker" "rm" "-f" "097cbd5766890814464c58e7dd6cb0d6f1309849d56dbdf5820a2f705bad77db", kill_on_drop: false }` [INFO] [stdout] 097cbd5766890814464c58e7dd6cb0d6f1309849d56dbdf5820a2f705bad77db [INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-2-tc2/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-2-tc2/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=warn" "-e" "RUSTDOCFLAGS=--cap-lints=warn" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+beta-2026-04-21" "build" "--frozen" "--message-format=json", kill_on_drop: false }` [INFO] [stdout] 571a683c750e1f9806086a2b9dc64d5dd2a8e2d7c7cd83690fec2885f53e1bc4 [INFO] running `Command { std: "docker" "start" "-a" "571a683c750e1f9806086a2b9dc64d5dd2a8e2d7c7cd83690fec2885f53e1bc4", kill_on_drop: false }` [INFO] [stderr] Compiling proc-macro2 v1.0.106 [INFO] [stderr] Compiling quote v1.0.44 [INFO] [stderr] Compiling unicode-ident v1.0.24 [INFO] [stderr] Compiling libc v0.2.182 [INFO] [stderr] Compiling cfg-if v1.0.4 [INFO] [stderr] Compiling libm v0.2.16 [INFO] [stderr] Compiling autocfg v1.5.0 [INFO] [stderr] Compiling zerocopy v0.8.40 [INFO] [stderr] Compiling getrandom v0.3.4 [INFO] [stderr] Compiling once_cell v1.21.3 [INFO] [stderr] Compiling version_check v0.9.5 [INFO] [stderr] Compiling crossbeam-utils v0.8.21 [INFO] [stderr] Compiling bytes v1.11.1 [INFO] [stderr] Compiling num-traits v0.2.19 [INFO] [stderr] Compiling reborrow v0.5.5 [INFO] [stderr] Compiling memchr v2.8.0 [INFO] [stderr] Compiling rayon-core v1.13.0 [INFO] [stderr] Compiling seq-macro v0.3.6 [INFO] [stderr] Compiling either v1.15.0 [INFO] [stderr] Compiling itoa v1.0.17 [INFO] [stderr] Compiling crossbeam-epoch v0.9.18 [INFO] [stderr] Compiling pin-project-lite v0.2.17 [INFO] [stderr] Compiling syn v2.0.117 [INFO] [stderr] Compiling bitflags v1.3.2 [INFO] [stderr] Compiling serde_core v1.0.228 [INFO] [stderr] Compiling pulp v0.21.5 [INFO] [stderr] Compiling crossbeam-deque v0.8.6 [INFO] [stderr] Compiling rand_core v0.9.5 [INFO] [stderr] Compiling bitflags v2.11.0 [INFO] [stderr] Compiling dyn-stack-macros v0.1.3 [INFO] [stderr] Compiling ahash v0.8.12 [INFO] [stderr] Compiling raw-cpuid v11.6.0 [INFO] [stderr] Compiling serde v1.0.228 [INFO] [stderr] Compiling zmij v1.0.21 [INFO] [stderr] Compiling rayon v1.11.0 [INFO] [stderr] Compiling raw-cpuid v10.7.0 [INFO] [stderr] Compiling serde_json v1.0.149 [INFO] [stderr] Compiling iana-time-zone v0.1.65 [INFO] [stderr] Compiling arrow-schema v53.4.1 [INFO] [stderr] Compiling num-integer v0.1.46 [INFO] [stderr] Compiling chrono v0.4.39 [INFO] [stderr] Compiling num-bigint v0.4.6 [INFO] [stderr] Compiling num-iter v0.1.45 [INFO] [stderr] Compiling tracing-core v0.1.36 [INFO] [stderr] Compiling equivalent v1.0.2 [INFO] [stderr] Compiling rustversion v1.0.22 [INFO] [stderr] Compiling hashbrown v0.15.5 [INFO] [stderr] Compiling log v0.4.29 [INFO] [stderr] Compiling hashbrown v0.16.1 [INFO] [stderr] Compiling winnow v0.7.14 [INFO] [stderr] Compiling toml_datetime v0.7.5+spec-1.1.0 [INFO] [stderr] Compiling num-rational v0.4.2 [INFO] [stderr] Compiling futures-core v0.3.32 [INFO] [stderr] Compiling lexical-util v1.0.7 [INFO] [stderr] Compiling futures-sink v0.3.32 [INFO] [stderr] Compiling thiserror v1.0.69 [INFO] [stderr] Compiling indexmap v2.13.0 [INFO] [stderr] Compiling aho-corasick v1.1.4 [INFO] [stderr] Compiling parking_lot_core v0.9.12 [INFO] [stderr] Compiling stable_deref_trait v1.2.1 [INFO] [stderr] Compiling synstructure v0.13.2 [INFO] [stderr] Compiling regex-syntax v0.8.10 [INFO] [stderr] Compiling crc32fast v1.5.0 [INFO] [stderr] Compiling byteorder v1.5.0 [INFO] [stderr] Compiling anyhow v1.0.102 [INFO] [stderr] Compiling lexical-write-integer v1.0.6 [INFO] [stderr] Compiling lexical-parse-integer v1.0.6 [INFO] [stderr] Compiling toml_parser v1.0.9+spec-1.1.0 [INFO] [stderr] Compiling futures-channel v0.3.32 [INFO] [stderr] Compiling smallvec v1.15.1 [INFO] [stderr] Compiling zip v1.1.4 [INFO] [stderr] Compiling semver v1.0.27 [INFO] [stderr] Compiling scopeguard v1.2.0 [INFO] [stderr] Compiling ryu v1.0.23 [INFO] [stderr] Compiling lock_api v0.4.14 [INFO] [stderr] Compiling rustc_version v0.4.1 [INFO] [stderr] Compiling toml_edit v0.23.10+spec-1.0.0 [INFO] [stderr] Compiling lexical-write-float v1.0.6 [INFO] [stderr] Compiling lexical-parse-float v1.0.6 [INFO] [stderr] Compiling memmap2 v0.9.10 [INFO] [stderr] Compiling getrandom v0.2.17 [INFO] [stderr] Compiling num_cpus v1.17.0 [INFO] [stderr] Compiling errno v0.3.14 [INFO] [stderr] Compiling libloading v0.8.9 [INFO] [stderr] Compiling unicode-segmentation v1.12.0 [INFO] [stderr] Compiling unicode-width v0.2.2 [INFO] [stderr] Compiling zerocopy-derive v0.8.40 [INFO] [stderr] Compiling bytemuck_derive v1.10.2 [INFO] [stderr] Compiling serde_derive v1.0.228 [INFO] [stderr] Compiling tracing-attributes v0.1.31 [INFO] [stderr] Compiling proc-macro-crate v3.4.0 [INFO] [stderr] Compiling zerofrom-derive v0.1.6 [INFO] [stderr] Compiling thiserror-impl v1.0.69 [INFO] [stderr] Compiling num_enum_derive v0.7.5 [INFO] [stderr] Compiling bytemuck v1.25.0 [INFO] [stderr] Compiling num-complex v0.4.6 [INFO] [stderr] Compiling dyn-stack v0.13.2 [INFO] [stderr] Compiling dyn-stack v0.10.0 [INFO] [stderr] Compiling tracing v0.1.44 [INFO] [stderr] Compiling zerofrom v0.1.6 [INFO] [stderr] Compiling regex-automata v0.4.14 [INFO] [stderr] Compiling yoke-derive v0.7.5 [INFO] [stderr] Compiling num v0.4.3 [INFO] [stderr] Compiling pulp v0.18.22 [INFO] [stderr] Compiling num_enum v0.7.5 [INFO] [stderr] Compiling displaydoc v0.2.5 [INFO] [stderr] Compiling comfy-table v7.2.2 [INFO] [stderr] Compiling futures-macro v0.3.32 [INFO] [stderr] Compiling yoke v0.7.5 [INFO] [stderr] Compiling tokio-macros v2.6.1 [INFO] [stderr] Compiling signal-hook-registry v1.4.8 [INFO] [stderr] Compiling lexical-core v1.0.6 [INFO] [stderr] Compiling rand_core v0.6.4 [INFO] [stderr] Compiling flatbuffers v24.12.23 [INFO] [stderr] Compiling parking_lot v0.12.5 [INFO] [stderr] Compiling atoi v2.0.0 [INFO] [stderr] Compiling mio v1.1.1 [INFO] [stderr] Compiling socket2 v0.6.2 [INFO] [stderr] Compiling http v1.4.0 [INFO] [stderr] Compiling safetensors v0.4.5 [INFO] [stderr] Compiling bit-vec v0.8.0 [INFO] [stderr] Compiling base64 v0.22.1 [INFO] [stderr] Compiling futures-io v0.3.32 [INFO] [stderr] Compiling slab v0.4.12 [INFO] [stderr] Compiling futures-task v0.3.32 [INFO] [stderr] Compiling bit-set v0.8.0 [INFO] [stderr] Compiling tokio v1.49.0 [INFO] [stderr] Compiling futures-util v0.3.32 [INFO] [stderr] Compiling snap v1.1.1 [INFO] [stderr] Compiling http-body v1.0.1 [INFO] [stderr] Compiling regex v1.12.3 [INFO] [stderr] Compiling fancy-regex v0.14.0 [INFO] [stderr] Compiling ordered-float v2.10.1 [INFO] [stderr] Compiling httparse v1.10.1 [INFO] [stderr] Compiling static_assertions v1.1.0 [INFO] [stderr] Compiling tower-service v0.3.3 [INFO] [stderr] Compiling integer-encoding v3.0.4 [INFO] [stderr] Compiling twox-hash v1.6.3 [INFO] [stderr] Compiling thrift v0.17.0 [INFO] [stderr] Compiling picochat-tokenizer v0.1.0 (/opt/rustwide/workdir/crates/picochat-tokenizer) [INFO] [stderr] Compiling percent-encoding v2.3.2 [INFO] [stderr] Compiling ppv-lite86 v0.2.21 [INFO] [stderr] Compiling tower-layer v0.3.3 [INFO] [stderr] Compiling httpdate v1.0.3 [INFO] [stderr] Compiling unicase v2.9.0 [INFO] [stderr] Compiling mime v0.3.17 [INFO] [stderr] Compiling rand_chacha v0.9.0 [INFO] [stderr] Compiling rand_chacha v0.3.1 [INFO] [stderr] Compiling mime_guess v2.0.5 [INFO] [stderr] Compiling rand v0.9.2 [INFO] [stderr] Compiling rand v0.8.5 [INFO] [stderr] Compiling http-body-util v0.1.3 [INFO] [stderr] Compiling picochat-tool v0.1.0 (/opt/rustwide/workdir/crates/picochat-tool) [INFO] [stderr] Compiling atomic-waker v1.1.2 [INFO] [stderr] Compiling utf8parse v0.2.2 [INFO] [stderr] Compiling rand_distr v0.5.1 [INFO] [stderr] Compiling sync_wrapper v1.0.2 [INFO] [stderr] Compiling anstyle-parse v0.2.7 [INFO] [stderr] Compiling form_urlencoded v1.2.2 [INFO] [stderr] Compiling async-trait v0.1.89 [INFO] [stderr] Compiling is_terminal_polyfill v1.70.2 [INFO] [stderr] Compiling anstyle-query v1.1.5 [INFO] [stderr] Compiling anstyle v1.0.13 [INFO] [stderr] Compiling colorchoice v1.0.4 [INFO] [stderr] Compiling serde_urlencoded v0.7.1 [INFO] [stderr] Compiling anstream v0.6.21 [INFO] [stderr] Compiling half v2.7.1 [INFO] [stderr] Compiling futures-executor v0.3.32 [INFO] [stderr] Compiling axum-macros v0.4.2 [INFO] [stderr] Compiling serde_path_to_error v0.1.20 [INFO] [stderr] Compiling arrow-buffer v53.4.1 [INFO] [stderr] Compiling gemm-common v0.18.2 [INFO] [stderr] Compiling gemm-common v0.17.1 [INFO] [stderr] Compiling axum-core v0.4.5 [INFO] [stderr] Compiling clap_lex v1.0.0 [INFO] [stderr] Compiling gemm-f32 v0.18.2 [INFO] [stderr] Compiling arrow-data v53.4.1 [INFO] [stderr] Compiling gemm-f32 v0.17.1 [INFO] [stderr] Compiling gemm-c64 v0.18.2 [INFO] [stderr] Compiling gemm-f64 v0.18.2 [INFO] [stderr] Compiling arrow-array v53.4.1 [INFO] [stderr] Compiling gemm-f16 v0.18.2 [INFO] [stderr] Compiling gemm-c32 v0.18.2 [INFO] [stderr] Compiling gemm-f16 v0.17.1 [INFO] [stderr] Compiling gemm-c32 v0.17.1 [INFO] [stderr] Compiling gemm-c64 v0.17.1 [INFO] [stderr] Compiling arrow-select v53.4.1 [INFO] [stderr] Compiling gemm v0.18.2 [INFO] [stderr] Compiling gemm-f64 v0.17.1 [INFO] [stderr] Compiling ug v0.1.0 [INFO] [stderr] Compiling gemm v0.17.1 [INFO] [stderr] Compiling arrow-cast v53.4.1 [INFO] [stderr] Compiling candle-core v0.8.4 [INFO] [stderr] Compiling arrow-string v53.4.1 [INFO] [stderr] Compiling arrow-ord v53.4.1 [INFO] [stderr] Compiling arrow-arith v53.4.1 [INFO] [stderr] Compiling arrow-row v53.4.1 [INFO] [stderr] Compiling hyper v1.8.1 [INFO] [stderr] Compiling arrow-ipc v53.4.1 [INFO] [stderr] Compiling arrow v53.4.1 [INFO] [stderr] Compiling hyper-util v0.1.20 [INFO] [stderr] Compiling tower v0.5.3 [INFO] [stderr] Compiling tokio-util v0.7.18 [INFO] [stderr] Compiling http-range-header v0.4.2 [INFO] [stderr] Compiling matchit v0.7.3 [INFO] [stderr] Compiling strsim v0.11.1 [INFO] [stderr] Compiling parquet v53.4.1 [INFO] [stderr] Compiling clap_derive v4.5.55 [INFO] [stderr] Compiling tower-http v0.5.2 [INFO] [stderr] Compiling axum v0.7.9 [INFO] [stderr] Compiling clap_builder v4.5.60 [INFO] [stderr] Compiling candle-nn v0.8.4 [INFO] [stderr] Compiling futures v0.3.32 [INFO] [stderr] Compiling picochat-core v0.1.0 (/opt/rustwide/workdir/crates/picochat-core) [INFO] [stderr] Compiling picochat-engine v0.1.0 (/opt/rustwide/workdir/crates/picochat-engine) [INFO] [stderr] Compiling picochat-optim v0.1.0 (/opt/rustwide/workdir/crates/picochat-optim) [INFO] [stderr] Compiling clap v4.5.60 [INFO] [stderr] Compiling picochat-data v0.1.0 (/opt/rustwide/workdir/crates/picochat-data) [INFO] [stderr] Compiling picochat-eval v0.1.0 (/opt/rustwide/workdir/crates/picochat-eval) [INFO] [stderr] Compiling picochat-train v0.1.0 (/opt/rustwide/workdir/crates/picochat-train) [INFO] [stderr] Compiling picochat-serve v0.1.0 (/opt/rustwide/workdir/crates/picochat-serve) [INFO] [stderr] Compiling picochat-cli v0.1.0 (/opt/rustwide/workdir/crates/picochat-cli) [INFO] [stderr] Finished `dev` profile [unoptimized + debuginfo] target(s) in 4m 42s [INFO] running `Command { std: "docker" "inspect" "571a683c750e1f9806086a2b9dc64d5dd2a8e2d7c7cd83690fec2885f53e1bc4", kill_on_drop: false }` [INFO] running `Command { std: "docker" "rm" "-f" "571a683c750e1f9806086a2b9dc64d5dd2a8e2d7c7cd83690fec2885f53e1bc4", kill_on_drop: false }` [INFO] [stdout] 571a683c750e1f9806086a2b9dc64d5dd2a8e2d7c7cd83690fec2885f53e1bc4 [INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-2-tc2/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-2-tc2/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=warn" "-e" "RUSTDOCFLAGS=--cap-lints=warn" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+beta-2026-04-21" "test" "--frozen" "--no-run" "--message-format=json", kill_on_drop: false }` [INFO] [stdout] 0c719fa092c505eadc7e80b9f037431b6ed4686a701faee122798dededb64c3a [INFO] running `Command { std: "docker" "start" "-a" "0c719fa092c505eadc7e80b9f037431b6ed4686a701faee122798dededb64c3a", kill_on_drop: false }` [INFO] [stderr] Compiling picochat-serve v0.1.0 (/opt/rustwide/workdir/crates/picochat-serve) [INFO] [stderr] Compiling picochat-train v0.1.0 (/opt/rustwide/workdir/crates/picochat-train) [INFO] [stderr] Compiling picochat-cli v0.1.0 (/opt/rustwide/workdir/crates/picochat-cli) [INFO] [stderr] Compiling picochat-eval v0.1.0 (/opt/rustwide/workdir/crates/picochat-eval) [INFO] [stderr] Compiling picochat-data v0.1.0 (/opt/rustwide/workdir/crates/picochat-data) [INFO] [stderr] Compiling picochat-optim v0.1.0 (/opt/rustwide/workdir/crates/picochat-optim) [INFO] [stderr] Compiling picochat-engine v0.1.0 (/opt/rustwide/workdir/crates/picochat-engine) [INFO] [stderr] Compiling picochat-tool v0.1.0 (/opt/rustwide/workdir/crates/picochat-tool) [INFO] [stderr] Compiling picochat-core v0.1.0 (/opt/rustwide/workdir/crates/picochat-core) [INFO] [stderr] Compiling picochat-tokenizer v0.1.0 (/opt/rustwide/workdir/crates/picochat-tokenizer) [INFO] [stderr] Finished `test` profile [unoptimized + debuginfo] target(s) in 53.29s [INFO] running `Command { std: "docker" "inspect" "0c719fa092c505eadc7e80b9f037431b6ed4686a701faee122798dededb64c3a", kill_on_drop: false }` [INFO] running `Command { std: "docker" "rm" "-f" "0c719fa092c505eadc7e80b9f037431b6ed4686a701faee122798dededb64c3a", kill_on_drop: false }` [INFO] [stdout] 0c719fa092c505eadc7e80b9f037431b6ed4686a701faee122798dededb64c3a [INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-2-tc2/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-2-tc2/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=warn" "-e" "RUSTDOCFLAGS=--cap-lints=warn" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+beta-2026-04-21" "test" "--frozen", kill_on_drop: false }` [INFO] [stdout] f8f01aad9de62757106dbf99489b026fffd24f44c38194361d14d50602647221 [INFO] running `Command { std: "docker" "start" "-a" "f8f01aad9de62757106dbf99489b026fffd24f44c38194361d14d50602647221", kill_on_drop: false }` [INFO] [stderr] Finished `test` profile [unoptimized + debuginfo] target(s) in 0.59s [INFO] [stderr] Running unittests src/main.rs (/opt/rustwide/target/debug/deps/picochat-3d213e8322aaa7f2) [INFO] [stdout] [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_core-f5cd44991902f08b) [INFO] [stdout] running 0 tests [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running tests/attention_test.rs (/opt/rustwide/target/debug/deps/attention_test-472e04a7cffe3d10) [INFO] [stdout] [INFO] [stdout] running 0 tests [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 2 tests [INFO] [stdout] test test_attention_output_shape ... ok [INFO] [stdout] test test_attention_causal_masking ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 1.02s [INFO] [stdout] [INFO] [stderr] Running tests/config_test.rs (/opt/rustwide/target/debug/deps/config_test-9e7448f4b02867e5) [INFO] [stdout] [INFO] [stdout] running 6 tests [INFO] [stdout] test test_depth_12_config ... ok [INFO] [stdout] test test_depth_26_gpt2_config ... ok [INFO] [stdout] test test_head_dim_consistent ... ok [INFO] [stdout] test test_padded_vocab_size ... ok [INFO] [stdout] test test_window_sizes ... ok [INFO] [stdout] test test_depth_4_small_config ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stderr] Running tests/init_test.rs (/opt/rustwide/target/debug/deps/init_test-6e5254d30e302af3) [INFO] [stdout] [INFO] [stdout] running 7 tests [INFO] [stdout] test test_c_proj_init_to_zero has been running for over 60 seconds [INFO] [stdout] test test_lm_head_init_narrow_normal has been running for over 60 seconds [INFO] [stdout] test test_resid_lambdas_init_to_one has been running for over 60 seconds [INFO] [stdout] test test_uniform_weights_in_range has been running for over 60 seconds [INFO] [stdout] test test_ve_gate_init_to_zero has been running for over 60 seconds [INFO] [stdout] test test_wte_init_normal has been running for over 60 seconds [INFO] [stdout] test test_x0_lambdas_init_to_point_one has been running for over 60 seconds [INFO] [stdout] test test_uniform_weights_in_range ... ok [INFO] [stdout] test test_x0_lambdas_init_to_point_one ... ok [INFO] [stdout] test test_ve_gate_init_to_zero ... ok [INFO] [stdout] test test_c_proj_init_to_zero ... ok [INFO] [stdout] test test_resid_lambdas_init_to_one ... ok [INFO] [stdout] test test_wte_init_normal ... ok [INFO] [stdout] test test_lm_head_init_narrow_normal ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 7 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 262.68s [INFO] [stdout] [INFO] [stderr] Running tests/kv_cache_test.rs (/opt/rustwide/target/debug/deps/kv_cache_test-30d68a9844efcd02) [INFO] [stdout] [INFO] [stdout] running 6 tests [INFO] [stdout] test test_kv_cache_new ... ok [INFO] [stdout] test test_kv_cache_reset ... ok [INFO] [stdout] test test_layer_cache_update ... ok [INFO] [stdout] test test_training_forward_unchanged ... ok [INFO] [stdout] test test_forward_with_cache_prefill ... ok [INFO] [stdout] test test_forward_with_cache_decode ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 26.18s [INFO] [stdout] [INFO] [stderr] Running tests/mlp_test.rs (/opt/rustwide/target/debug/deps/mlp_test-0416dbba690bc241) [INFO] [stdout] [INFO] [stdout] running 2 tests [INFO] [stdout] test test_mlp_relu_squared_activation ... ok [INFO] [stdout] test test_mlp_output_shape ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.06s [INFO] [stdout] [INFO] [stderr] Running tests/model_test.rs (/opt/rustwide/target/debug/deps/model_test-ee873a06b024f766) [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_gpt_depth4_small has been running for over 60 seconds [INFO] [stdout] test test_gpt_forward_logits_shape has been running for over 60 seconds [INFO] [stdout] test test_gpt_forward_with_targets has been running for over 60 seconds [INFO] [stdout] test test_gpt_depth4_small ... ok [INFO] [stdout] test test_gpt_forward_with_targets ... ok [INFO] [stdout] test test_gpt_forward_logits_shape ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 64.57s [INFO] [stdout] [INFO] [stderr] Running tests/norm_test.rs (/opt/rustwide/target/debug/deps/norm_test-89bf14c16f9c2230) [INFO] [stdout] [INFO] [stdout] running 2 tests [INFO] [stdout] test test_rms_norm_shape_preserved ... ok [INFO] [stderr] Running tests/rotary_test.rs (/opt/rustwide/target/debug/deps/rotary_test-484680f435d39e31) [INFO] [stdout] test test_rms_norm_unit_rms ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_apply_rotary_emb_shape ... ok [INFO] [stdout] test test_rotary_precompute_shapes ... ok [INFO] [stdout] test test_rotary_offset_for_kv_cache ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.10s [INFO] [stdout] [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_data-426294693b3d3ef7) [INFO] [stdout] [INFO] [stdout] running 0 tests [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running tests/arc_test.rs (/opt/rustwide/target/debug/deps/arc_test-5b80439ed6d44fc5) [INFO] [stdout] [INFO] [stdout] running 4 tests [INFO] [stdout] test test_load_arc_from_string ... ok [INFO] [stdout] test test_arc_question_parse ... ok [INFO] [stdout] test test_arc_question_answer_index ... ok [INFO] [stdout] test test_format_arc_prompt ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stderr] Running tests/dataloader_test.rs (/opt/rustwide/target/debug/deps/dataloader_test-9d0af0cc04d6112d) [INFO] [stdout] [INFO] [stdout] running 10 tests [INFO] [stdout] test test_packing_long_document_splits ... ok [INFO] [stdout] test test_packing_single_document ... ok [INFO] [stdout] test test_packing_target_shift ... ok [INFO] [stdout] test test_packing_flush_pads ... ok [INFO] [stdout] test test_packing_bos_prepended ... ok [INFO] [stdout] test test_dataloader_batch_shape ... ok [INFO] [stdout] test test_dataloader_target_is_shifted_input ... ok [INFO] [stdout] test test_dataset_empty ... ok [INFO] [stdout] test test_packing_batch_returns_none_when_insufficient ... ok [INFO] [stdout] test test_dataset_len ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 10 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.02s [INFO] [stdout] [INFO] [stderr] Running tests/mixture_test.rs (/opt/rustwide/target/debug/deps/mixture_test-0c420fd315feacad) [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_mixture_epoch_cycling ... ok [INFO] [stdout] test test_mixture_single_dataset ... ok [INFO] [stdout] test test_mixture_weighted_sampling ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.02s [INFO] [stdout] [INFO] [stderr] Running tests/parquet_test.rs (/opt/rustwide/target/debug/deps/parquet_test-0718d44bf833f08a) [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_read_parquet_texts ... ok [INFO] [stdout] test test_read_all_text ... ok [INFO] [stdout] test test_missing_column_error ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.03s [INFO] [stdout] [INFO] [stderr] Running tests/sft_test.rs (/opt/rustwide/target/debug/deps/sft_test-f9a1f60cb6f6024a) [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_mask_alignment_multi_turn ... ok [INFO] [stdout] test test_chat_message_parse ... ok [INFO] [stdout] test test_mask_alignment_single_turn ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stderr] Running tests/tool_data_test.rs (/opt/rustwide/target/debug/deps/tool_data_test-835e103eb1a66ac9) [INFO] [stdout] [INFO] [stdout] running 4 tests [INFO] [stdout] test test_tool_scenario_no_tool ... ok [INFO] [stdout] test test_tool_scenario_parse ... ok [INFO] [stdout] test test_load_tool_scenarios_from_string ... ok [INFO] [stdout] test test_format_tool_prompt ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_engine-a049176e944ccf22) [INFO] [stdout] [INFO] [stderr] Running tests/generate_logprobs_test.rs (/opt/rustwide/target/debug/deps/generate_logprobs_test-f665af8165d2b9c6) [INFO] [stdout] running 0 tests [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_logprobs_with_stop_token ... ok [INFO] [stdout] test test_logprobs_greedy_deterministic ... ok [INFO] [stdout] test test_logprobs_returns_ids_and_probs ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 30.58s [INFO] [stdout] [INFO] [stderr] Running tests/generate_test.rs (/opt/rustwide/target/debug/deps/generate_test-ce75d6d3f96f56d4) [INFO] [stdout] [INFO] [stdout] running 4 tests [INFO] [stdout] test test_generate_stops_at_max_tokens ... ok [INFO] [stdout] test test_generate_stops_at_stop_token ... ok [INFO] [stdout] test test_generate_produces_tokens ... ok [INFO] [stdout] test test_greedy_is_deterministic ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 40.48s [INFO] [stdout] [INFO] [stderr] Running tests/quantize_test.rs (/opt/rustwide/target/debug/deps/quantize_test-eea3850716473bae) [INFO] [stdout] [INFO] [stdout] running 4 tests [INFO] [stdout] test test_quantize_zeros ... ok [INFO] [stdout] test test_quantize_large_values ... ok [INFO] [stdout] test test_quantize_roundtrip ... ok [INFO] [stderr] Running tests/reasoning_test.rs (/opt/rustwide/target/debug/deps/reasoning_test-4dd15da04f5b138d) [INFO] [stdout] test test_scales_shape ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_output_segment_variants ... ok [INFO] [stdout] test test_output_segment_equality ... ok [INFO] [stdout] test test_segment_text_extraction ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stderr] Running tests/sampling_test.rs (/opt/rustwide/target/debug/deps/sampling_test-916150a20183a8a4) [INFO] [stdout] [INFO] [stdout] running 5 tests [INFO] [stdout] test test_default_params ... ok [INFO] [stdout] test test_greedy_with_zero_temperature ... ok [INFO] [stdout] test test_greedy_returns_argmax ... ok [INFO] [stdout] test test_top_k_limits_candidates ... ok [INFO] [stdout] test test_sample_respects_distribution ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_eval-18aa150d98e3978a) [INFO] [stdout] [INFO] [stdout] running 0 tests [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running tests/arc_eval_test.rs (/opt/rustwide/target/debug/deps/arc_eval_test-061c66448cdcfc75) [INFO] [stdout] [INFO] [stdout] running 1 test [INFO] [stdout] test test_arc_result_accuracy ... ok [INFO] [stderr] Running tests/bpb_test.rs (/opt/rustwide/target/debug/deps/bpb_test-b302816f08794b3c) [INFO] [stdout] [INFO] [stdout] test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 2 tests [INFO] [stdout] test test_bpb_result_fields ... ok [INFO] [stderr] Running tests/gsm8k_test.rs (/opt/rustwide/target/debug/deps/gsm8k_test-bbfcc9cc95fe6cf0) [INFO] [stdout] test test_bpb_formula ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 6 tests [INFO] [stdout] test test_format_gsm_prompt ... ok [INFO] [stdout] test test_extract_answer_basic ... ok [INFO] [stdout] test test_extract_answer_none ... ok [INFO] [stdout] test test_extract_answer_negative ... ok [INFO] [stdout] test test_extract_answer_decimal ... ok [INFO] [stderr] Running tests/mmlu_test.rs (/opt/rustwide/target/debug/deps/mmlu_test-a8b47ee3c58e6b17) [INFO] [stdout] test test_extract_answer_with_comma ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_pick_answer_from_logprobs ... ok [INFO] [stdout] test test_format_mmlu_prompt ... ok [INFO] [stdout] test test_pick_answer_tie_favors_first ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stderr] Running tests/reasoning_eval_test.rs (/opt/rustwide/target/debug/deps/reasoning_eval_test-ffb8be9d49c02356) [INFO] [stdout] [INFO] [stdout] running 5 tests [INFO] [stdout] test test_reasoning_metrics_empty ... ok [INFO] [stdout] test test_multiple_think_blocks ... ok [INFO] [stdout] test test_reasoning_metrics_with_thinking ... ok [INFO] [stdout] test test_self_correction_patterns ... ok [INFO] [stdout] test test_reasoning_metrics_no_thinking ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_optim-0faec94e3bace918) [INFO] [stdout] [INFO] [stdout] running 0 tests [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running tests/adamw_test.rs (/opt/rustwide/target/debug/deps/adamw_test-6c64c907c2b91633) [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_adamw_weight_decay ... ok [INFO] [stdout] test test_adamw_single_step_changes_params ... ok [INFO] [stdout] test test_adamw_multiple_steps_converge ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.02s [INFO] [stdout] [INFO] [stderr] Running tests/combined_test.rs (/opt/rustwide/target/debug/deps/combined_test-c5bef36d58fa1acc) [INFO] [stdout] [INFO] [stdout] running 5 tests [INFO] [stdout] test test_classify_params_by_name ... ok [INFO] [stdout] test test_scaling_with_different_n_embd ... ok [INFO] [stdout] test test_from_varmap_classifies_correctly ... ok [INFO] [stderr] Running tests/muon_test.rs (/opt/rustwide/target/debug/deps/muon_test-a1d89a09fffdd685) [INFO] [stdout] test test_combined_step_updates_all_params ... ok [INFO] [stdout] test test_combined_with_lr_multiplier ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.04s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 4 tests [INFO] [stderr] Running tests/schedule_test.rs (/opt/rustwide/target/debug/deps/schedule_test-f78df9cdacf39bb7) [INFO] [stdout] test test_polar_express_tall_matrix ... ok [INFO] [stdout] test test_muon_momentum_accumulates ... ok [INFO] [stdout] test test_muon_single_step ... ok [INFO] [stdout] test test_polar_express_near_orthogonal ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.02s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 5 tests [INFO] [stdout] test test_warmdown_ends_at_zero ... ok [INFO] [stdout] test test_constant_phase ... ok [INFO] [stdout] test test_warmdown_is_cosine ... ok [INFO] [stdout] test test_warmup_reaches_base_lr ... ok [INFO] [stdout] test test_warmup_starts_at_zero ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.05s [INFO] [stdout] [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_serve-650f589f0ccb28fa) [INFO] [stderr] Running tests/serve_test.rs (/opt/rustwide/target/debug/deps/serve_test-2ed534881a1e6242) [INFO] [stdout] [INFO] [stdout] running 0 tests [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 2 tests [INFO] [stdout] test test_sse_payload_serialization ... ok [INFO] [stdout] test test_segment_to_type_mapping ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_tokenizer-d4a89971056e8668) [INFO] [stdout] [INFO] [stdout] running 0 tests [INFO] [stderr] Running tests/bpe_test.rs (/opt/rustwide/target/debug/deps/bpe_test-04f80a18995902fd) [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 6 tests [INFO] [stdout] test test_merge_vocab_concatenation ... ok [INFO] [stdout] test test_vocab_has_byte_tokens ... ok [INFO] [stdout] test test_train_small_vocab ... ok [INFO] [stdout] test test_train_merges_count ... ok [INFO] [stdout] test test_pattern_splits_contractions ... ok [INFO] [stdout] test test_gpt4_pattern_compiles ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.20s [INFO] [stdout] [INFO] [stderr] Running tests/encode_test.rs (/opt/rustwide/target/debug/deps/encode_test-c9bc9dc8fcd70ad1) [INFO] [stdout] [INFO] [stdout] running 8 tests [INFO] [stdout] test test_encode_special_tokens ... ok [INFO] [stdout] test test_unicode_roundtrip ... ok [INFO] [stdout] test test_encode_empty_string ... ok [INFO] [stdout] test test_encode_reduces_token_count ... ok [INFO] [stdout] test test_encode_single_byte ... ok [INFO] [stdout] test test_encode_decode_roundtrip ... ok [INFO] [stdout] test test_adjacent_special_tokens ... ok [INFO] [stdout] test test_save_load_roundtrip ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 8 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.48s [INFO] [stdout] [INFO] [stderr] Running tests/special_test.rs (/opt/rustwide/target/debug/deps/special_test-928e7a1944ae5ff1) [INFO] [stdout] [INFO] [stdout] running 6 tests [INFO] [stdout] test test_all_strings_unique ... ok [INFO] [stdout] test test_registry_non_special_id_returns_none ... ok [INFO] [stdout] test test_registry_roundtrip_id ... ok [INFO] [stdout] test test_registry_ids_at_end_of_vocab ... ok [INFO] [stdout] test test_special_token_roundtrip_str ... ok [INFO] [stdout] test test_special_token_count ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_tool-da16ecaeb209b21a) [INFO] [stdout] [INFO] [stdout] running 0 tests [INFO] [stderr] Running tests/ast_test.rs (/opt/rustwide/target/debug/deps/ast_test-bc34effe1b6ec198) [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 13 tests [INFO] [stdout] test test_parse_binary_arithmetic ... ok [INFO] [stdout] test test_parse_method_call ... ok [INFO] [stdout] test test_parse_parenthesized ... ok [INFO] [stdout] test test_tokenize_arithmetic ... ok [INFO] [stdout] test test_parse_power ... ok [INFO] [stdout] test test_tokenize_function_call ... ok [INFO] [stdout] test test_tokenize_method_call ... ok [INFO] [stdout] test test_tokenize_string ... ok [INFO] [stdout] test test_tokenize_negative_number ... ok [INFO] [stdout] test test_tokenize_comparison ... ok [INFO] [stdout] test test_tokenize_number ... ok [INFO] [stdout] test test_parse_function_call ... ok [INFO] [stdout] test test_parse_operator_precedence ... ok [INFO] [stdout] [INFO] [stderr] Running tests/evaluator_test.rs (/opt/rustwide/target/debug/deps/evaluator_test-ea284549f044a3b0) [INFO] [stdout] test result: ok. 13 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 13 tests [INFO] [stdout] test test_basic_arithmetic ... ok [INFO] [stdout] test test_math_functions ... ok [INFO] [stdout] test test_parse_error ... ok [INFO] [stdout] test test_operator_precedence ... ok [INFO] [stdout] test test_integer_display ... ok [INFO] [stdout] test test_string_count ... ok [INFO] [stdout] test test_power ... ok [INFO] [stdout] test test_string_len ... ok [INFO] [stdout] test test_comparisons ... ok [INFO] [stdout] test test_string_upper_lower ... ok [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_train-467c3affeb64ed77) [INFO] [stdout] test test_unary_minus ... ok [INFO] [stdout] test test_unknown_function ... ok [INFO] [stdout] test test_division_by_zero ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 13 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stdout] [INFO] [stderr] Running tests/checkpoint_test.rs (/opt/rustwide/target/debug/deps/checkpoint_test-673b5f16504405e0) [INFO] [stdout] running 0 tests [INFO] [stdout] [INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 2 tests [INFO] [stdout] test test_save_and_load_config ... ok [INFO] [stdout] test test_save_and_load_roundtrip has been running for over 60 seconds [INFO] [stdout] test test_save_and_load_roundtrip ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 67.18s [INFO] [stdout] [INFO] [stderr] Running tests/grpo_test.rs (/opt/rustwide/target/debug/deps/grpo_test-f881de489d61200a) [INFO] [stdout] [INFO] [stdout] running 5 tests [INFO] [stdout] test test_compute_clipped_objective ... ok [INFO] [stdout] test test_grpo_config_defaults ... ok [INFO] [stdout] test test_compute_kl_penalty ... ok [INFO] [stdout] test test_normalize_advantages_all_same ... ok [INFO] [stdout] test test_normalize_advantages ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running tests/metrics_test.rs (/opt/rustwide/target/debug/deps/metrics_test-c09626fe0f1c57bf) [INFO] [stdout] [INFO] [stdout] running 4 tests [INFO] [stdout] test test_bpb_basic ... ok [INFO] [stdout] test test_mfu ... ok [INFO] [stdout] test test_throughput ... ok [INFO] [stdout] test test_tracker_accumulation ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stderr] Running tests/pretrain_test.rs (/opt/rustwide/target/debug/deps/pretrain_test-055c00bae08b1d9d) [INFO] [stdout] [INFO] [stdout] running 2 tests [INFO] [stdout] test test_pretrain_config_defaults ... ok [INFO] [stdout] test test_pretrain_tokens_per_step ... ok [INFO] [stdout] [INFO] [stderr] Running tests/rewards_test.rs (/opt/rustwide/target/debug/deps/rewards_test-0250886bc498129b) [INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 20 tests [INFO] [stdout] test test_accuracy_reward_math_correct ... ok [INFO] [stdout] test test_accuracy_reward_mc_correct ... ok [INFO] [stdout] test test_composite_reward ... ok [INFO] [stdout] test test_extract_final_answer_math ... ok [INFO] [stdout] test test_extract_final_answer_mc ... ok [INFO] [stdout] test test_format_reward_malformed_think ... ok [INFO] [stdout] test test_format_reward_missing_think ... ok [INFO] [stdout] test test_accuracy_reward_math_wrong ... ok [INFO] [stdout] test test_format_reward_think_after_answer ... ok [INFO] [stdout] test test_length_penalty ... ok [INFO] [stdout] test test_strip_think_blocks ... ok [INFO] [stdout] test test_strip_think_blocks_multiple ... ok [INFO] [stdout] test test_format_reward_think_but_no_answer ... ok [INFO] [stdout] test test_format_reward_valid ... ok [INFO] [stdout] test test_strip_think_blocks_none ... ok [INFO] [stdout] test test_tool_use_reward_correct_and_useful ... ok [INFO] [stdout] test test_tool_use_reward_no_tool_when_needed ... ok [INFO] [stdout] test test_tool_use_reward_correct_syntax_but_wrong ... ok [INFO] [stdout] test test_tool_use_reward_no_tool_not_needed ... ok [INFO] [stdout] test test_extract_final_answer_mc_from_choices ... ok [INFO] [stderr] Running tests/sft_test.rs (/opt/rustwide/target/debug/deps/sft_test-8e11f7e142b243ea) [INFO] [stdout] [INFO] [stdout] test result: ok. 20 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stdout] [INFO] [stdout] running 3 tests [INFO] [stdout] test test_masked_cross_entropy_all_masked ... ok [INFO] [stdout] test test_masked_cross_entropy_basic ... ok [INFO] [stdout] test test_sft_config ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s [INFO] [stdout] [INFO] [stderr] Running tests/trainer_test.rs (/opt/rustwide/target/debug/deps/trainer_test-97390a1706934352) [INFO] [stdout] [INFO] [stdout] running 2 tests [INFO] [stdout] test test_loss_decreases_over_steps has been running for over 60 seconds [INFO] [stdout] test test_single_train_step has been running for over 60 seconds [INFO] [stdout] test test_single_train_step ... ok [ERROR] error running command: command timed out after 900 seconds [INFO] running `Command { std: "docker" "inspect" "f8f01aad9de62757106dbf99489b026fffd24f44c38194361d14d50602647221", kill_on_drop: false }` [INFO] running `Command { std: "docker" "rm" "-f" "f8f01aad9de62757106dbf99489b026fffd24f44c38194361d14d50602647221", kill_on_drop: false }` [INFO] [stdout] f8f01aad9de62757106dbf99489b026fffd24f44c38194361d14d50602647221