[INFO] fetching crate evals 0.3.0... [INFO] testing evals-0.3.0 against master#562dee4820c458d823175268e41601d4c060588a for pr-154210-1 [INFO] extracting crate evals 0.3.0 into /workspace/builds/worker-4-tc1/source [INFO] started tweaking crates.io crate evals 0.3.0 [INFO] removed 0 missing examples [INFO] finished tweaking crates.io crate evals 0.3.0 [INFO] tweaked toml for crates.io crate evals 0.3.0 written to /workspace/builds/worker-4-tc1/source/Cargo.toml [INFO] validating manifest of crates.io crate evals 0.3.0 on toolchain 562dee4820c458d823175268e41601d4c060588a [INFO] running `Command { std: CARGO_HOME="/workspace/cargo-home" RUSTUP_HOME="/workspace/rustup-home" "/workspace/cargo-home/bin/cargo" "+562dee4820c458d823175268e41601d4c060588a" "metadata" "--manifest-path" "Cargo.toml" "--no-deps", kill_on_drop: false }` [INFO] crate crates.io crate evals 0.3.0 already has a lockfile, it will not be regenerated [INFO] running `Command { std: CARGO_HOME="/workspace/cargo-home" RUSTUP_HOME="/workspace/rustup-home" "/workspace/cargo-home/bin/cargo" "+562dee4820c458d823175268e41601d4c060588a" "fetch" "--manifest-path" "Cargo.toml", kill_on_drop: false }` [INFO] [stderr] Updating crates.io index [INFO] [stderr] Downloading crates ... [INFO] [stderr] Downloaded config v0.15.22 [INFO] [stderr] Downloaded swift-rs v1.0.7 [INFO] [stderr] Downloaded agents-proc-macros v0.3.0 [INFO] [stderr] Downloaded evals-macros v0.3.0 [INFO] [stderr] Downloaded worker-macros v0.7.5 [INFO] [stderr] Downloaded worker-sys v0.7.5 [INFO] [stderr] Downloaded worker v0.7.5 [INFO] [stderr] Downloaded agents v0.3.0 [INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-4-tc1/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-4-tc1/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+562dee4820c458d823175268e41601d4c060588a" "metadata" "--no-deps" "--format-version=1", kill_on_drop: false }` [INFO] [stdout] 93429b58377a989617970bc32ee61e549ce2945c2053be9742f5f7c15c3ca9c8 [INFO] running `Command { std: "docker" "start" "-a" "93429b58377a989617970bc32ee61e549ce2945c2053be9742f5f7c15c3ca9c8", kill_on_drop: false }` [INFO] running `Command { std: "docker" "inspect" "93429b58377a989617970bc32ee61e549ce2945c2053be9742f5f7c15c3ca9c8", kill_on_drop: false }` [INFO] running `Command { std: "docker" "rm" "-f" "93429b58377a989617970bc32ee61e549ce2945c2053be9742f5f7c15c3ca9c8", kill_on_drop: false }` [INFO] [stdout] 93429b58377a989617970bc32ee61e549ce2945c2053be9742f5f7c15c3ca9c8 [INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-4-tc1/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-4-tc1/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=forbid" "-e" "RUSTDOCFLAGS=--cap-lints=forbid" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+562dee4820c458d823175268e41601d4c060588a" "build" "--frozen" "--message-format=json", kill_on_drop: false }` [INFO] [stdout] 27697b81d77a801e02d073de094a7e35083a6e4eafff07698598e5f8c6e7837d [INFO] running `Command { std: "docker" "start" "-a" "27697b81d77a801e02d073de094a7e35083a6e4eafff07698598e5f8c6e7837d", kill_on_drop: false }` [INFO] [stderr] Compiling proc-macro2 v1.0.106 [INFO] [stderr] Compiling unicode-ident v1.0.24 [INFO] [stderr] Compiling libc v0.2.183 [INFO] [stderr] Compiling serde_core v1.0.228 [INFO] [stderr] Compiling itoa v1.0.17 [INFO] [stderr] Compiling memchr v2.8.0 [INFO] [stderr] Compiling quote v1.0.45 [INFO] [stderr] Compiling log v0.4.29 [INFO] [stderr] Compiling pin-project-lite v0.2.17 [INFO] [stderr] Compiling smallvec v1.15.1 [INFO] [stderr] Compiling zmij v1.0.21 [INFO] [stderr] Compiling bytes v1.11.1 [INFO] [stderr] Compiling futures-core v0.3.32 [INFO] [stderr] Compiling once_cell v1.21.4 [INFO] [stderr] Compiling find-msvc-tools v0.1.9 [INFO] [stderr] Compiling cc v1.2.57 [INFO] [stderr] Compiling ryu v1.0.23 [INFO] [stderr] Compiling syn v2.0.117 [INFO] [stderr] Compiling http v1.4.0 [INFO] [stderr] Compiling crunchy v0.2.4 [INFO] [stderr] Compiling thiserror v2.0.18 [INFO] [stderr] Compiling zeroize v1.8.2 [INFO] [stderr] Compiling rustls-pki-types v1.14.0 [INFO] [stderr] Compiling tracing-core v0.1.36 [INFO] [stderr] Compiling errno v0.3.14 [INFO] [stderr] Compiling mio v1.1.1 [INFO] [stderr] Compiling signal-hook-registry v1.4.8 [INFO] [stderr] Compiling socket2 v0.6.3 [INFO] [stderr] Compiling http-body v1.0.1 [INFO] [stderr] Compiling getrandom v0.2.17 [INFO] [stderr] Compiling percent-encoding v2.3.2 [INFO] [stderr] Compiling futures-task v0.3.32 [INFO] [stderr] Compiling serde_json v1.0.149 [INFO] [stderr] Compiling serde v1.0.228 [INFO] [stderr] Compiling slab v0.4.12 [INFO] [stderr] Compiling foldhash v0.2.0 [INFO] [stderr] Compiling ring v0.17.14 [INFO] [stderr] Compiling hashbrown v0.16.1 [INFO] [stderr] Compiling bitflags v2.11.0 [INFO] [stderr] Compiling rustls v0.23.37 [INFO] [stderr] Compiling getrandom v0.4.2 [INFO] [stderr] Compiling rustix v1.1.4 [INFO] [stderr] Compiling itertools v0.14.0 [INFO] [stderr] Compiling tiny-keccak v2.0.2 [INFO] [stderr] Compiling castaway v0.2.4 [INFO] [stderr] Compiling futures-channel v0.3.32 [INFO] [stderr] Compiling indoc v2.0.7 [INFO] [stderr] Compiling linux-raw-sys v0.12.1 [INFO] [stderr] Compiling parking_lot_core v0.9.12 [INFO] [stderr] Compiling typeid v1.0.3 [INFO] [stderr] Compiling instability v0.3.12 [INFO] [stderr] Compiling unicode-width v0.2.2 [INFO] [stderr] Compiling unicase v2.9.0 [INFO] [stderr] Compiling const-random-macro v0.1.16 [INFO] [stderr] Compiling mime_guess v2.0.5 [INFO] [stderr] Compiling pest v2.8.6 [INFO] [stderr] Compiling compact_str v0.9.0 [INFO] [stderr] Compiling lru v0.16.3 [INFO] [stderr] Compiling unicode-truncate v2.0.1 [INFO] [stderr] Compiling form_urlencoded v1.2.2 [INFO] [stderr] Compiling convert_case v0.10.0 [INFO] [stderr] Compiling sync_wrapper v1.0.2 [INFO] [stderr] Compiling ipnet v2.12.0 [INFO] [stderr] Compiling lock_api v0.4.14 [INFO] [stderr] Compiling deranged v0.5.8 [INFO] [stderr] Compiling synstructure v0.13.2 [INFO] [stderr] Compiling darling_core v0.23.0 [INFO] [stderr] Compiling darling_core v0.20.11 [INFO] [stderr] Compiling pest_meta v2.8.6 [INFO] [stderr] Compiling const-random v0.1.18 [INFO] [stderr] Compiling signal-hook v0.3.18 [INFO] [stderr] Compiling webpki-roots v1.0.6 [INFO] [stderr] Compiling rustls-webpki v0.103.9 [INFO] [stderr] Compiling num-conv v0.2.0 [INFO] [stderr] Compiling litrs v1.0.0 [INFO] [stderr] Compiling base64 v0.21.7 [INFO] [stderr] Compiling time-core v0.1.8 [INFO] [stderr] Compiling zerofrom-derive v0.1.6 [INFO] [stderr] Compiling serde_derive v1.0.228 [INFO] [stderr] Compiling yoke-derive v0.8.1 [INFO] [stderr] Compiling zerovec-derive v0.11.2 [INFO] [stderr] Compiling displaydoc v0.2.5 [INFO] [stderr] Compiling tokio-macros v2.6.1 [INFO] [stderr] Compiling futures-macro v0.3.32 [INFO] [stderr] Compiling thiserror-impl v2.0.18 [INFO] [stderr] Compiling tracing-attributes v0.1.31 [INFO] [stderr] Compiling zerofrom v0.1.6 [INFO] [stderr] Compiling yoke v0.8.1 [INFO] [stderr] Compiling zerotrie v0.2.3 [INFO] [stderr] Compiling zerovec v0.11.5 [INFO] [stderr] Compiling tokio v1.50.0 [INFO] [stderr] Compiling darling_macro v0.23.0 [INFO] [stderr] Compiling futures-util v0.3.32 [INFO] [stderr] Compiling strum_macros v0.27.2 [INFO] [stderr] Compiling darling v0.23.0 [INFO] [stderr] Compiling darling_macro v0.20.11 [INFO] [stderr] Compiling tinystr v0.8.2 [INFO] [stderr] Compiling potential_utf v0.1.4 [INFO] [stderr] Compiling icu_locale_core v2.1.1 [INFO] [stderr] Compiling icu_collections v2.1.1 [INFO] [stderr] Compiling tracing v0.1.44 [INFO] [stderr] Compiling kasuari v0.4.12 [INFO] [stderr] Compiling darling v0.20.11 [INFO] [stderr] Compiling icu_provider v2.1.1 [INFO] [stderr] Compiling icu_normalizer v2.1.1 [INFO] [stderr] Compiling icu_properties v2.1.2 [INFO] [stderr] Compiling derive_more-impl v2.1.1 [INFO] [stderr] Compiling futures-sink v0.3.32 [INFO] [stderr] Compiling strum v0.27.2 [INFO] [stderr] Compiling iri-string v0.7.10 [INFO] [stderr] Compiling ratatui-core v0.1.0 [INFO] [stderr] Compiling num_threads v0.1.7 [INFO] [stderr] Compiling regex-syntax v0.8.10 [INFO] [stderr] Compiling erased-serde v0.4.10 [INFO] [stderr] Compiling swift-rs v1.0.7 [INFO] [stderr] Compiling idna_adapter v1.2.1 [INFO] [stderr] Compiling idna v1.1.0 [INFO] [stderr] Compiling time v0.3.47 [INFO] [stderr] Compiling url v2.5.8 [INFO] [stderr] Compiling regex-automata v0.4.14 [INFO] [stderr] Compiling nom v7.1.3 [INFO] [stderr] Compiling serde_urlencoded v0.7.1 [INFO] [stderr] Compiling hashbrown v0.15.5 [INFO] [stderr] Compiling derive_more v2.1.1 [INFO] [stderr] Compiling derive_builder_core v0.20.2 [INFO] [stderr] Compiling document-features v0.2.12 [INFO] [stderr] Compiling hyper v1.8.1 [INFO] [stderr] Compiling tokio-rustls v0.26.4 [INFO] [stderr] Compiling tower v0.5.3 [INFO] [stderr] Compiling tokio-util v0.7.18 [INFO] [stderr] Compiling ref-cast-impl v1.0.25 [INFO] [stderr] Compiling thiserror-impl v1.0.69 [INFO] [stderr] Compiling tower-http v0.6.8 [INFO] [stderr] Compiling signal-hook-mio v0.2.5 [INFO] [stderr] Compiling pest_generator v2.8.6 [INFO] [stderr] Compiling dlv-list v0.5.2 [INFO] [stderr] Compiling hyper-util v0.1.20 [INFO] [stderr] Compiling parking_lot v0.12.5 [INFO] [stderr] Compiling serde_derive_internals v0.29.1 [INFO] [stderr] Compiling uuid v1.22.0 [INFO] [stderr] Compiling line-clipping v0.3.5 [INFO] [stderr] Compiling http-body-util v0.1.3 [INFO] [stderr] Compiling winnow v1.0.0 [INFO] [stderr] Compiling ordered-multimap v0.7.3 [INFO] [stderr] Compiling cfb v0.7.3 [INFO] [stderr] Compiling ratatui-widgets v0.3.0 [INFO] [stderr] Compiling hyper-rustls v0.27.7 [INFO] [stderr] Compiling schemars_derive v1.2.1 [INFO] [stderr] Compiling pest_derive v2.8.6 [INFO] [stderr] Compiling thiserror v1.0.69 [INFO] [stderr] Compiling crossterm v0.29.0 [INFO] [stderr] Compiling reqwest v0.12.28 [INFO] [stderr] Compiling ref-cast v1.0.25 [INFO] [stderr] Compiling matchers v0.2.0 [INFO] [stderr] Compiling derive_builder_macro v0.20.2 [INFO] [stderr] Compiling eventsource-stream v0.2.3 [INFO] [stderr] Compiling hashlink v0.10.0 [INFO] [stderr] Compiling toml_parser v1.0.10+spec-1.1.0 [INFO] [stderr] Compiling agents v0.3.0 [INFO] [stderr] Compiling async-trait v0.1.89 [INFO] [stderr] Compiling serde_spanned v1.0.4 [INFO] [stderr] Compiling toml_datetime v1.0.1+spec-1.1.0 [INFO] [stderr] Compiling tracing-log v0.2.0 [INFO] [stderr] Compiling encoding_rs v0.8.35 [INFO] [stderr] Compiling anyhow v1.0.102 [INFO] [stderr] Compiling fastrand v2.3.0 [INFO] [stderr] Compiling tempfile v3.27.0 [INFO] [stderr] Compiling prettyplease v0.2.37 [INFO] [stderr] Compiling tracing-subscriber v0.3.23 [INFO] [stderr] Compiling reqwest-eventsource v0.6.0 [INFO] [stderr] Compiling toml v1.0.7+spec-1.1.0 [INFO] [stderr] Compiling serde-untagged v0.1.9 [INFO] [stderr] Compiling schemars v1.2.1 [INFO] [stderr] Compiling yaml-rust2 v0.10.4 [INFO] [stderr] Compiling ratatui-crossterm v0.1.0 [INFO] [stderr] Compiling ratatui-macros v0.7.0 [INFO] [stderr] Compiling derive_builder v0.20.2 [INFO] [stderr] Compiling json5 v0.4.1 [INFO] [stderr] Compiling infer v0.19.0 [INFO] [stderr] Compiling rust-ini v0.21.3 [INFO] [stderr] Compiling ron v0.12.0 [INFO] [stderr] Compiling agents-proc-macros v0.3.0 [INFO] [stderr] Compiling humantime v2.3.0 [INFO] [stderr] Compiling same-file v1.0.6 [INFO] [stderr] Compiling walkdir v2.5.0 [INFO] [stderr] Compiling evals-macros v0.3.0 [INFO] [stderr] Compiling chrono v0.4.44 [INFO] [stderr] Compiling ratatui v0.30.0 [INFO] [stderr] Compiling config v0.15.22 [INFO] [stderr] Compiling evals v0.3.0 (/opt/rustwide/workdir) [INFO] [stderr] Finished `dev` profile [unoptimized + debuginfo] target(s) in 3m 23s [INFO] running `Command { std: "docker" "inspect" "27697b81d77a801e02d073de094a7e35083a6e4eafff07698598e5f8c6e7837d", kill_on_drop: false }` [INFO] running `Command { std: "docker" "rm" "-f" "27697b81d77a801e02d073de094a7e35083a6e4eafff07698598e5f8c6e7837d", kill_on_drop: false }` [INFO] [stdout] 27697b81d77a801e02d073de094a7e35083a6e4eafff07698598e5f8c6e7837d [INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-4-tc1/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-4-tc1/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=forbid" "-e" "RUSTDOCFLAGS=--cap-lints=forbid" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+562dee4820c458d823175268e41601d4c060588a" "test" "--frozen" "--no-run" "--message-format=json", kill_on_drop: false }` [INFO] [stdout] a842b9b7ebc5a459196c0a8b08d7e22173f1f66a8cc1fe410dadaad174f982e5 [INFO] running `Command { std: "docker" "start" "-a" "a842b9b7ebc5a459196c0a8b08d7e22173f1f66a8cc1fe410dadaad174f982e5", kill_on_drop: false }` [INFO] [stderr] Compiling evals v0.3.0 (/opt/rustwide/workdir) [INFO] [stderr] Finished `test` profile [unoptimized + debuginfo] target(s) in 21.86s [INFO] running `Command { std: "docker" "inspect" "a842b9b7ebc5a459196c0a8b08d7e22173f1f66a8cc1fe410dadaad174f982e5", kill_on_drop: false }` [INFO] running `Command { std: "docker" "rm" "-f" "a842b9b7ebc5a459196c0a8b08d7e22173f1f66a8cc1fe410dadaad174f982e5", kill_on_drop: false }` [INFO] [stdout] a842b9b7ebc5a459196c0a8b08d7e22173f1f66a8cc1fe410dadaad174f982e5 [INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-4-tc1/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-4-tc1/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=forbid" "-e" "RUSTDOCFLAGS=--cap-lints=forbid" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+562dee4820c458d823175268e41601d4c060588a" "test" "--frozen", kill_on_drop: false }` [INFO] [stdout] d8423fdfcbaca7afdd7a8441a60e638d0457fc558365c4ffb08ab3079ac12d20 [INFO] running `Command { std: "docker" "start" "-a" "d8423fdfcbaca7afdd7a8441a60e638d0457fc558365c4ffb08ab3079ac12d20", kill_on_drop: false }` [INFO] [stderr] Finished `test` profile [unoptimized + debuginfo] target(s) in 0.51s [INFO] [stderr] Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/evals-d4a74a0593235290) [INFO] [stdout] [INFO] [stdout] running 34 tests [INFO] [stdout] test tests::display_label_returns_configured_label ... ok [INFO] [stdout] test runner::config::tests::loads_human_timeout_from_evals_toml ... ok [INFO] [stdout] test runner::config::tests::rejects_timeout_and_timeout_secs_together ... ok [INFO] [stdout] test runner::config::tests::loads_workers_ai_provider_config_from_evals_toml ... ok [INFO] [stdout] test tests::eval_tags_builder_extends_tags_in_order ... ok [INFO] [stdout] test runner::tests::init_workspace_writes_example_config ... ok [INFO] [stdout] test tests::agent_factory_receives_target_scoped_llm_runner_in_context ... ok [INFO] [stdout] test runner::tests::init_workspace_requires_force_to_overwrite ... ok [INFO] [stdout] test tests::grader_means_count_failed_trials_as_zero_score ... ok [INFO] [stdout] test tests::judge_uses_context_runner_and_returns_normal_grade ... ok [INFO] [stdout] test runner::config::tests::loads_provider_ollama_url_from_evals_toml ... ok [INFO] [stdout] test tests::failed_trials_can_persist_partial_agent_output ... ok [INFO] [stdout] test tests::report_writer_persists_expected_files ... ok [INFO] [stdout] test tests::runtime_failures_are_preserved_in_persisted_transcript ... ok [INFO] [stdout] test tests::suite_runner_expands_model_matrix ... ok [INFO] [stdout] test tests::suite_records_failed_trials_without_aborting_run ... ok [INFO] [stdout] test tests::eval_specific_timeout_overrides_run_timeout ... ok [INFO] [stdout] test tests::suite_runner_plan_errors_when_no_model_matches ... ok [INFO] [stdout] test tests::suite_runner_plan_errors_when_no_query_matches ... ok [INFO] [stdout] test tests::suite_runner_plan_keeps_full_suite_when_query_matches_suite_id ... ok [INFO] [stdout] test tests::suite_runner_plan_filters_evals_and_targets_before_execution ... ok [INFO] [stdout] test tests::suite_runs_trials_and_aggregates_scores ... ok [INFO] [stdout] test tests::failed_trajectory_grading_is_preserved_in_persisted_transcript ... ok [INFO] [stdout] test tests::summary_table_groups_rows_by_eval ... ok [INFO] [stdout] test tests::trajectory_expectations_appear_in_trial_and_summary_grades ... ok [INFO] [stdout] test tests::trajectory_macro_builds_linear_steps ... ok [INFO] [stdout] test tests::persisted_trials_include_materialized_context_windows ... ok [INFO] [stdout] test tests::trajectory_trials_record_step_inputs_and_grading_events ... ok [INFO] [stdout] test tests::suite_runner_records_structured_timeout_errors ... ok [INFO] [stdout] test tests::usage_is_aggregated_once_per_provider_response ... ok [INFO] [stdout] test tests::trial_records_capture_timing_and_summary_averages ... ok [INFO] [stdout] test tests::hosted_targets_run_trials_concurrently_within_limit ... ok [INFO] [stdout] test tests::persisted_runs_flush_trial_records_before_completion ... ok [INFO] [stdout] test tests::hosted_targets_overlap_with_local_targets_while_local_targets_serialize ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 34 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.26s [INFO] [stdout] [INFO] [stderr] Doc-tests evals [INFO] [stdout] [INFO] [stdout] running 6 tests [INFO] [stdout] test src/grade.rs - grade::predicate (line 280) - compile ... ok [INFO] [stdout] test src/lib.rs - assistant (line 89) - compile ... ok [INFO] [stdout] test src/lib.rs - (line 12) - compile ... ok [INFO] [stdout] test src/judge.rs - judge::judge (line 123) ... ok [INFO] [stdout] test src/lib.rs - trajectory (line 117) ... ok [INFO] [stdout] test src/lib.rs - user (line 72) ... ok [INFO] [stdout] [INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.06s [INFO] [stdout] [INFO] [stdout] all doctests ran in 3.20s; merged doctests compilation took 3.09s [INFO] running `Command { std: "docker" "inspect" "d8423fdfcbaca7afdd7a8441a60e638d0457fc558365c4ffb08ab3079ac12d20", kill_on_drop: false }` [INFO] running `Command { std: "docker" "rm" "-f" "d8423fdfcbaca7afdd7a8441a60e638d0457fc558365c4ffb08ab3079ac12d20", kill_on_drop: false }` [INFO] [stdout] d8423fdfcbaca7afdd7a8441a60e638d0457fc558365c4ffb08ab3079ac12d20