Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Flex - Removed required field false positive for stop_id + added Foreign key violation for location_groups_id #1834

Merged

Conversation

jcpitre
Copy link
Contributor

@jcpitre jcpitre commented Sep 10, 2024

Closes #1749

Summary:

foreign_key_violation is triggered when location_group_id is defined in stop_times.txt but not in location_groups.txt.

  • Added a foreign key annotation to location_group_id in stopTimeSchema.

Check that missing_required_field no longer triggers when there's no stop_id in stop_times.txt AND a present id in locations.geojson OR location_group_id in location_groups.txt.

  • Made the stop_id conditionally required.
  • Added missing fields in stoptimeSchema.
  • Added a validator that checks that no more than one of stop_id, location_goup_id and or stop_id is present
  • the presence in locations.geojson is out of scope for this PR.

Also verified that feature Zone-based Demand Responsive Transit is present in the list of features for a dataset with Flex. We could not test this in #1776 until the @required was removed from stopId.

Expected behavior:

Most of missing required fields related to stopId should disappear from reports.
Some other errors or warning that were masked by the missing required field notice should appear. Some of these will have to be corrected other issues. e.g. #1755

Please make sure these boxes are checked before submitting your pull request - thanks!

  • Run the unit tests with gradle test to make sure you didn't break anything
  • Add or update any needed documentation to the repo
  • Format the title like "feat: [new feature short description]". Title must follow the Conventional Commit Specification(https://www.conventionalcommits.org/en/v1.0.0/).
  • Linked all relevant issues
  • Include screenshot(s) showing how this pull request works and fixes the issue(s)

@jcpitre jcpitre linked an issue Sep 10, 2024 that may be closed by this pull request
Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

❌ The rule acceptance test has failed for commit 76011f5
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (18 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_trip_edge
us-chicago-brown-county-heartland-express-gtfs-2037 missing_trip_edge
us-chicago-the-otter-express-gtfs-2038 missing_trip_edge
us-colorado-all-points-transit-gtfs-2039 missing_trip_edge
us-colorado-avon-transit-gtfs-2040 missing_trip_edge
us-colorado-baca-area-transportation-gtfs-1087 missing_trip_edge
us-colorado-cripple-creek-transportation-gtfs-2042 missing_trip_edge
us-colorado-discover-goodwill-gtfs-2043 missing_trip_edge
us-colorado-dolores-county-gtfs-1141 missing_trip_edge
us-colorado-easy-ride-transportation-gtfs-1195 missing_trip_edge
us-colorado-eco-transit-gtfs-2045 missing_trip_edge
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_trip_edge
us-colorado-envida-gtfs-2044 missing_trip_edge
us-colorado-gunnison-valley-rta-gtfs-2048 missing_trip_edge
us-colorado-pueblo-transit-gtfs-1232 missing_trip_edge
us-colorado-steamboat-springs-transit-gtfs-2051 missing_trip_edge
us-colorado-town-of-telluride-gtfs-2050 missing_trip_edge
us-colorado-tsc-transit-gtfs-2052 missing_trip_edge
Dropped Errors (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_required_field
us-chicago-brown-county-heartland-express-gtfs-2037 missing_required_field
us-chicago-the-otter-express-gtfs-2038 missing_required_field
us-colorado-all-points-transit-gtfs-2039 missing_required_field
us-colorado-avon-transit-gtfs-2040 missing_required_field
us-colorado-baca-area-transportation-gtfs-1087 missing_required_field
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 missing_required_field
us-colorado-cripple-creek-transportation-gtfs-2042 missing_required_field
us-colorado-discover-goodwill-gtfs-2043 missing_required_field
us-colorado-dolores-county-gtfs-1141 missing_required_field
us-colorado-easy-ride-transportation-gtfs-1195 missing_required_field
us-colorado-eco-transit-gtfs-2045 missing_required_field
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_required_field
us-colorado-envida-gtfs-2044 missing_required_field
us-colorado-estes-transit-gtfs-2047 missing_required_field
us-colorado-gunnison-valley-rta-gtfs-2048 missing_required_field
us-colorado-pueblo-transit-gtfs-1232 missing_required_field
us-colorado-steamboat-springs-transit-gtfs-2051 missing_required_field
us-colorado-town-of-telluride-gtfs-2050 missing_required_field
us-colorado-tsc-transit-gtfs-2052 missing_required_field
gb-unknown-uk-aggregate-feed-gtfs-2014 trip_distance_exceeds_shape_distance
New Warnings (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
us-chicago-brown-county-heartland-express-gtfs-2037 fast_travel_between_far_stops
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 fast_travel_between_far_stops
us-colorado-estes-transit-gtfs-2047 fast_travel_between_far_stops
us-chicago-brown-county-heartland-express-gtfs-2037 stop_too_far_from_shape
us-colorado-all-points-transit-gtfs-2039 stop_too_far_from_shape
us-colorado-baca-area-transportation-gtfs-1087 stop_too_far_from_shape
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 stop_too_far_from_shape
us-colorado-estes-transit-gtfs-2047 stop_too_far_from_shape
us-colorado-tsc-transit-gtfs-2052 stop_too_far_from_shape
de-unknown-postauto-ag-gtfs-2053 stop_without_stop_time
pl-podkarpackie-komunikacja-miejska-przemysl-gtfs-1000 stop_without_stop_time
us-chicago-brown-county-heartland-express-gtfs-2037 stop_without_stop_time
us-chicago-the-otter-express-gtfs-2038 stop_without_stop_time
us-colorado-all-points-transit-gtfs-2039 stop_without_stop_time
us-colorado-avon-transit-gtfs-2040 stop_without_stop_time
us-colorado-baca-area-transportation-gtfs-1087 stop_without_stop_time
us-colorado-cripple-creek-transportation-gtfs-2042 stop_without_stop_time
us-colorado-discover-goodwill-gtfs-2043 stop_without_stop_time
us-colorado-dolores-county-gtfs-1141 stop_without_stop_time
us-colorado-easy-ride-transportation-gtfs-1195 stop_without_stop_time
us-colorado-eco-transit-gtfs-2045 stop_without_stop_time
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 stop_without_stop_time
us-colorado-envida-gtfs-2044 stop_without_stop_time
us-colorado-estes-transit-gtfs-2047 stop_without_stop_time
us-colorado-gunnison-valley-rta-gtfs-2048 stop_without_stop_time
us-colorado-pueblo-transit-gtfs-1232 stop_without_stop_time
us-colorado-steamboat-springs-transit-gtfs-2051 stop_without_stop_time
us-colorado-town-of-telluride-gtfs-2050 stop_without_stop_time
us-colorado-envida-gtfs-2044 trip_distance_exceeds_shape_distance_below_threshold
us-colorado-steamboat-springs-transit-gtfs-2051 trip_distance_exceeds_shape_distance_below_threshold
Dropped Warnings (1 out of 1575 datasets, ~0%) ✅

Details of new errors due to code change, which is less than the provided threshold of 1%.

Dataset Notice Code
gb-unknown-uk-aggregate-feed-gtfs-2014 fast_travel_between_consecutive_stops
gb-unknown-uk-aggregate-feed-gtfs-2014 fast_travel_between_far_stops
gb-unknown-uk-aggregate-feed-gtfs-2014 missing_bike_allowance
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_has_too_many_matches_for_shape
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_too_far_from_shape
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_too_far_from_shape_using_user_distance
gb-unknown-uk-aggregate-feed-gtfs-2014 stops_match_shape_out_of_order
gb-unknown-uk-aggregate-feed-gtfs-2014 trip_distance_exceeds_shape_distance_below_threshold

🛡️ Corruption Check

0 out of 1575 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 3.88 4.39 ⬆️+0.51
Median -- 1.40 1.49 ⬆️+0.09
Standard Deviation -- 11.01 22.21 ⬆️+11.20
Minimum in References Reports us-florida-citrus-county-transit-gtfs-630 0.51 0.56 ⬆️+0.05
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 315.62 825.19 ⬆️+509.57
Minimum in Latest Reports ph-unknown-hm-transport-inc-and-robinsons-malls-gtfs-1105 0.58 0.51 ⬇️-0.07
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 315.62 825.19 ⬆️+509.57

@jcpitre jcpitre marked this pull request as draft September 10, 2024 12:26
@jcpitre jcpitre marked this pull request as ready for review September 10, 2024 15:52
Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

❌ The rule acceptance test has failed for commit 247f277
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (18 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_trip_edge
us-chicago-brown-county-heartland-express-gtfs-2037 missing_trip_edge
us-chicago-the-otter-express-gtfs-2038 missing_trip_edge
us-colorado-all-points-transit-gtfs-2039 missing_trip_edge
us-colorado-avon-transit-gtfs-2040 missing_trip_edge
us-colorado-baca-area-transportation-gtfs-1087 missing_trip_edge
us-colorado-cripple-creek-transportation-gtfs-2042 missing_trip_edge
us-colorado-discover-goodwill-gtfs-2043 missing_trip_edge
us-colorado-dolores-county-gtfs-1141 missing_trip_edge
us-colorado-easy-ride-transportation-gtfs-1195 missing_trip_edge
us-colorado-eco-transit-gtfs-2045 missing_trip_edge
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_trip_edge
us-colorado-envida-gtfs-2044 missing_trip_edge
us-colorado-gunnison-valley-rta-gtfs-2048 missing_trip_edge
us-colorado-pueblo-transit-gtfs-1232 missing_trip_edge
us-colorado-steamboat-springs-transit-gtfs-2051 missing_trip_edge
us-colorado-town-of-telluride-gtfs-2050 missing_trip_edge
us-colorado-tsc-transit-gtfs-2052 missing_trip_edge
Dropped Errors (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_required_field
us-chicago-brown-county-heartland-express-gtfs-2037 missing_required_field
us-chicago-the-otter-express-gtfs-2038 missing_required_field
us-colorado-all-points-transit-gtfs-2039 missing_required_field
us-colorado-avon-transit-gtfs-2040 missing_required_field
us-colorado-baca-area-transportation-gtfs-1087 missing_required_field
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 missing_required_field
us-colorado-cripple-creek-transportation-gtfs-2042 missing_required_field
us-colorado-discover-goodwill-gtfs-2043 missing_required_field
us-colorado-dolores-county-gtfs-1141 missing_required_field
us-colorado-easy-ride-transportation-gtfs-1195 missing_required_field
us-colorado-eco-transit-gtfs-2045 missing_required_field
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_required_field
us-colorado-envida-gtfs-2044 missing_required_field
us-colorado-estes-transit-gtfs-2047 missing_required_field
us-colorado-gunnison-valley-rta-gtfs-2048 missing_required_field
us-colorado-pueblo-transit-gtfs-1232 missing_required_field
us-colorado-steamboat-springs-transit-gtfs-2051 missing_required_field
us-colorado-town-of-telluride-gtfs-2050 missing_required_field
us-colorado-tsc-transit-gtfs-2052 missing_required_field
gb-unknown-uk-aggregate-feed-gtfs-2014 trip_distance_exceeds_shape_distance
New Warnings (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
us-chicago-brown-county-heartland-express-gtfs-2037 fast_travel_between_far_stops
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 fast_travel_between_far_stops
us-colorado-estes-transit-gtfs-2047 fast_travel_between_far_stops
us-chicago-brown-county-heartland-express-gtfs-2037 stop_too_far_from_shape
us-colorado-all-points-transit-gtfs-2039 stop_too_far_from_shape
us-colorado-baca-area-transportation-gtfs-1087 stop_too_far_from_shape
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 stop_too_far_from_shape
us-colorado-estes-transit-gtfs-2047 stop_too_far_from_shape
us-colorado-tsc-transit-gtfs-2052 stop_too_far_from_shape
de-unknown-postauto-ag-gtfs-2053 stop_without_stop_time
pl-podkarpackie-komunikacja-miejska-przemysl-gtfs-1000 stop_without_stop_time
us-chicago-brown-county-heartland-express-gtfs-2037 stop_without_stop_time
us-chicago-the-otter-express-gtfs-2038 stop_without_stop_time
us-colorado-all-points-transit-gtfs-2039 stop_without_stop_time
us-colorado-avon-transit-gtfs-2040 stop_without_stop_time
us-colorado-baca-area-transportation-gtfs-1087 stop_without_stop_time
us-colorado-cripple-creek-transportation-gtfs-2042 stop_without_stop_time
us-colorado-discover-goodwill-gtfs-2043 stop_without_stop_time
us-colorado-dolores-county-gtfs-1141 stop_without_stop_time
us-colorado-easy-ride-transportation-gtfs-1195 stop_without_stop_time
us-colorado-eco-transit-gtfs-2045 stop_without_stop_time
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 stop_without_stop_time
us-colorado-envida-gtfs-2044 stop_without_stop_time
us-colorado-estes-transit-gtfs-2047 stop_without_stop_time
us-colorado-gunnison-valley-rta-gtfs-2048 stop_without_stop_time
us-colorado-pueblo-transit-gtfs-1232 stop_without_stop_time
us-colorado-steamboat-springs-transit-gtfs-2051 stop_without_stop_time
us-colorado-town-of-telluride-gtfs-2050 stop_without_stop_time
us-colorado-envida-gtfs-2044 trip_distance_exceeds_shape_distance_below_threshold
us-colorado-steamboat-springs-transit-gtfs-2051 trip_distance_exceeds_shape_distance_below_threshold
Dropped Warnings (1 out of 1575 datasets, ~0%) ✅

Details of new errors due to code change, which is less than the provided threshold of 1%.

Dataset Notice Code
gb-unknown-uk-aggregate-feed-gtfs-2014 fast_travel_between_consecutive_stops
gb-unknown-uk-aggregate-feed-gtfs-2014 fast_travel_between_far_stops
gb-unknown-uk-aggregate-feed-gtfs-2014 missing_bike_allowance
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_has_too_many_matches_for_shape
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_too_far_from_shape
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_too_far_from_shape_using_user_distance
gb-unknown-uk-aggregate-feed-gtfs-2014 stops_match_shape_out_of_order
gb-unknown-uk-aggregate-feed-gtfs-2014 trip_distance_exceeds_shape_distance_below_threshold

🛡️ Corruption Check

0 out of 1575 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 3.91 4.58 ⬆️+0.66
Median -- 1.41 1.49 ⬆️+0.07
Standard Deviation -- 10.94 21.73 ⬆️+10.79
Minimum in References Reports ph-unknown-hm-transport-inc-and-robinsons-malls-gtfs-1105 0.50 0.50 ⬆️+0.00
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 308.86 747.53 ⬆️+438.67
Minimum in Latest Reports ph-unknown-hm-transport-inc-and-robinsons-malls-gtfs-1105 0.50 0.50 ⬆️+0.00
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 308.86 747.53 ⬆️+438.67

@@ -0,0 +1,52 @@
/*
* Copyright 2020 Google LLC
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[picky]:

Suggested change
* Copyright 2020 Google LLC
* Copyright 2024 MobilityData

Comment on lines 76 to 80
@ConditionallyRequired
GtfsTime startPickupDropOffWindow();

@ConditionallyRequired
GtfsTime endPickupDropOffWindow();
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there any validator implementing the conditional required logic for these fields? If not, please remove the annotation and bring it back when the logic is implemented.

@jcpitre
Copy link
Contributor Author

jcpitre commented Sep 10, 2024

About the faliing check

  • New errors:

  • Dropped errors:

    • missing_required_field: corrected by this PR
    • trip_distance_exceeds_shape_distance: Not sure why it dropped TBD
  • New warnings:

    • TBD. One problem with the stop_id required field false positive fixed is that stop_times.txt is not considered unparseable any more. This could make some of the warnings appear because stop_times.txt is now parseable.

Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

❌ The rule acceptance test has failed for commit 9551526
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (18 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_trip_edge
us-chicago-brown-county-heartland-express-gtfs-2037 missing_trip_edge
us-chicago-the-otter-express-gtfs-2038 missing_trip_edge
us-colorado-all-points-transit-gtfs-2039 missing_trip_edge
us-colorado-avon-transit-gtfs-2040 missing_trip_edge
us-colorado-baca-area-transportation-gtfs-1087 missing_trip_edge
us-colorado-cripple-creek-transportation-gtfs-2042 missing_trip_edge
us-colorado-discover-goodwill-gtfs-2043 missing_trip_edge
us-colorado-dolores-county-gtfs-1141 missing_trip_edge
us-colorado-easy-ride-transportation-gtfs-1195 missing_trip_edge
us-colorado-eco-transit-gtfs-2045 missing_trip_edge
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_trip_edge
us-colorado-envida-gtfs-2044 missing_trip_edge
us-colorado-gunnison-valley-rta-gtfs-2048 missing_trip_edge
us-colorado-pueblo-transit-gtfs-1232 missing_trip_edge
us-colorado-steamboat-springs-transit-gtfs-2051 missing_trip_edge
us-colorado-town-of-telluride-gtfs-2050 missing_trip_edge
us-colorado-tsc-transit-gtfs-2052 missing_trip_edge
Dropped Errors (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_required_field
us-chicago-brown-county-heartland-express-gtfs-2037 missing_required_field
us-chicago-the-otter-express-gtfs-2038 missing_required_field
us-colorado-all-points-transit-gtfs-2039 missing_required_field
us-colorado-avon-transit-gtfs-2040 missing_required_field
us-colorado-baca-area-transportation-gtfs-1087 missing_required_field
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 missing_required_field
us-colorado-cripple-creek-transportation-gtfs-2042 missing_required_field
us-colorado-discover-goodwill-gtfs-2043 missing_required_field
us-colorado-dolores-county-gtfs-1141 missing_required_field
us-colorado-easy-ride-transportation-gtfs-1195 missing_required_field
us-colorado-eco-transit-gtfs-2045 missing_required_field
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_required_field
us-colorado-envida-gtfs-2044 missing_required_field
us-colorado-estes-transit-gtfs-2047 missing_required_field
us-colorado-gunnison-valley-rta-gtfs-2048 missing_required_field
us-colorado-pueblo-transit-gtfs-1232 missing_required_field
us-colorado-steamboat-springs-transit-gtfs-2051 missing_required_field
us-colorado-town-of-telluride-gtfs-2050 missing_required_field
us-colorado-tsc-transit-gtfs-2052 missing_required_field
gb-unknown-uk-aggregate-feed-gtfs-2014 trip_distance_exceeds_shape_distance
New Warnings (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
us-chicago-brown-county-heartland-express-gtfs-2037 fast_travel_between_far_stops
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 fast_travel_between_far_stops
us-colorado-estes-transit-gtfs-2047 fast_travel_between_far_stops
us-chicago-brown-county-heartland-express-gtfs-2037 stop_too_far_from_shape
us-colorado-all-points-transit-gtfs-2039 stop_too_far_from_shape
us-colorado-baca-area-transportation-gtfs-1087 stop_too_far_from_shape
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 stop_too_far_from_shape
us-colorado-estes-transit-gtfs-2047 stop_too_far_from_shape
us-colorado-tsc-transit-gtfs-2052 stop_too_far_from_shape
de-unknown-postauto-ag-gtfs-2053 stop_without_stop_time
pl-podkarpackie-komunikacja-miejska-przemysl-gtfs-1000 stop_without_stop_time
us-chicago-brown-county-heartland-express-gtfs-2037 stop_without_stop_time
us-chicago-the-otter-express-gtfs-2038 stop_without_stop_time
us-colorado-all-points-transit-gtfs-2039 stop_without_stop_time
us-colorado-avon-transit-gtfs-2040 stop_without_stop_time
us-colorado-baca-area-transportation-gtfs-1087 stop_without_stop_time
us-colorado-cripple-creek-transportation-gtfs-2042 stop_without_stop_time
us-colorado-discover-goodwill-gtfs-2043 stop_without_stop_time
us-colorado-dolores-county-gtfs-1141 stop_without_stop_time
us-colorado-easy-ride-transportation-gtfs-1195 stop_without_stop_time
us-colorado-eco-transit-gtfs-2045 stop_without_stop_time
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 stop_without_stop_time
us-colorado-envida-gtfs-2044 stop_without_stop_time
us-colorado-estes-transit-gtfs-2047 stop_without_stop_time
us-colorado-gunnison-valley-rta-gtfs-2048 stop_without_stop_time
us-colorado-pueblo-transit-gtfs-1232 stop_without_stop_time
us-colorado-steamboat-springs-transit-gtfs-2051 stop_without_stop_time
us-colorado-town-of-telluride-gtfs-2050 stop_without_stop_time
us-colorado-envida-gtfs-2044 trip_distance_exceeds_shape_distance_below_threshold
us-colorado-steamboat-springs-transit-gtfs-2051 trip_distance_exceeds_shape_distance_below_threshold
Dropped Warnings (1 out of 1575 datasets, ~0%) ✅

Details of new errors due to code change, which is less than the provided threshold of 1%.

Dataset Notice Code
gb-unknown-uk-aggregate-feed-gtfs-2014 fast_travel_between_consecutive_stops
gb-unknown-uk-aggregate-feed-gtfs-2014 fast_travel_between_far_stops
gb-unknown-uk-aggregate-feed-gtfs-2014 missing_bike_allowance
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_has_too_many_matches_for_shape
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_too_far_from_shape
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_too_far_from_shape_using_user_distance
gb-unknown-uk-aggregate-feed-gtfs-2014 stops_match_shape_out_of_order
gb-unknown-uk-aggregate-feed-gtfs-2014 trip_distance_exceeds_shape_distance_below_threshold

🛡️ Corruption Check

0 out of 1575 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 3.94 4.48 ⬆️+0.54
Median -- 1.40 1.48 ⬆️+0.08
Standard Deviation -- 11.21 23.89 ⬆️+12.68
Minimum in References Reports ph-unknown-hm-transport-inc-and-robinsons-malls-gtfs-1105 0.49 0.55 ⬆️+0.06
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 322.53 893.96 ⬆️+571.43
Minimum in Latest Reports us-california-city-of-wasco-gtfs-1788 0.60 0.52 ⬇️-0.07
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 322.53 893.96 ⬆️+571.43

@emmambd
Copy link
Contributor

emmambd commented Sep 10, 2024

@jcpitre These changes look to be expected and positive! Learnings from this:

  • I've added Flex: Update missing_trip_edge rule to create exception  #1755 back into the 6.0 release since this is a false positive we'll need to correct beforehand. Outside the scope of this PR.

  • In general, I'm wondering about gb-unknown-uk-aggregate-feed-gtfs-2014 in the performance tests. It taking 7 extra minutes to run the validation report seems surprising...maybe not given all the additional fields added to stop_times.txt.

  • Before these acceptance tests were run, I noticed only 1 feed in our analytics dashboard that had the missing_required_field notice because of stop_id and was unrelated to Flex data: https://mobilitydatabase.org/feeds/mdb-1000. It looks like it's a partial GTFS file (doesn't include all required files). This is a feed I would've expected to be affected by the new notice since stop_id is no longer included in missing_required_field. What am I missing?

@emmambd
Copy link
Contributor

emmambd commented Sep 10, 2024

@jcpitre I also just want to clarify the merging order of these PRs.

It looks like this can be merged independently on #1805 and #1810? Should adding the checks in the new validator that evaluates the unique id for geography inlocations.geojson be a new issue or included in #1810?

@jcpitre
Copy link
Contributor Author

jcpitre commented Sep 10, 2024

@jcpitre I also just want to clarify the merging order of these PRs.

It looks like this can be merged independently on #1805 and #1810? Should adding the checks in the new validator that evaluates the unique id for geography inlocations.geojson be a new issue or included in #1810?

Do you mean this: GtfsGeojsonFeatureUniqueLocationIdValidator.java
I don't see why it should have its own issue. It's kind of done in #1810 as a demonstration. Also now that location_groups.txt has been added I can just modify the validator to take it into account.

@emmambd
Copy link
Contributor

emmambd commented Sep 10, 2024

@jcpitre Sounds good! So it sounds like the validator in #1810 will be modified to include location_groups.txt info once this PR is merged

@jcpitre
Copy link
Contributor Author

jcpitre commented Sep 10, 2024

  • In general, I'm wondering about gb-unknown-uk-aggregate-feed-gtfs-2014 in the performance tests. I'm assuming its performance issues are unrelated to the coding changes, but taking 7 extra minutes to run the validation report seems surprising.

The stop_times in this datataset has nearly 42 million records. I added 5 new fields in the schema for that file. So at least for memory it's quite a chunk more, and it takes more time to load. The new validator has to go through these 42 million records. Does it amount to 7 minutes? It could I guess.

Let me correct that: it went from 322 to 893 seconds. It is indeed surprising that it almost tripled

Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

❌ The rule acceptance test has failed for commit 77fa357
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (18 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_trip_edge
us-chicago-brown-county-heartland-express-gtfs-2037 missing_trip_edge
us-chicago-the-otter-express-gtfs-2038 missing_trip_edge
us-colorado-all-points-transit-gtfs-2039 missing_trip_edge
us-colorado-avon-transit-gtfs-2040 missing_trip_edge
us-colorado-baca-area-transportation-gtfs-1087 missing_trip_edge
us-colorado-cripple-creek-transportation-gtfs-2042 missing_trip_edge
us-colorado-discover-goodwill-gtfs-2043 missing_trip_edge
us-colorado-dolores-county-gtfs-1141 missing_trip_edge
us-colorado-easy-ride-transportation-gtfs-1195 missing_trip_edge
us-colorado-eco-transit-gtfs-2045 missing_trip_edge
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_trip_edge
us-colorado-envida-gtfs-2044 missing_trip_edge
us-colorado-gunnison-valley-rta-gtfs-2048 missing_trip_edge
us-colorado-pueblo-transit-gtfs-1232 missing_trip_edge
us-colorado-steamboat-springs-transit-gtfs-2051 missing_trip_edge
us-colorado-town-of-telluride-gtfs-2050 missing_trip_edge
us-colorado-tsc-transit-gtfs-2052 missing_trip_edge
Dropped Errors (22 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_required_field
pl-podkarpackie-komunikacja-miejska-przemysl-gtfs-1000 missing_required_field
us-chicago-brown-county-heartland-express-gtfs-2037 missing_required_field
us-chicago-the-otter-express-gtfs-2038 missing_required_field
us-colorado-all-points-transit-gtfs-2039 missing_required_field
us-colorado-avon-transit-gtfs-2040 missing_required_field
us-colorado-baca-area-transportation-gtfs-1087 missing_required_field
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 missing_required_field
us-colorado-cripple-creek-transportation-gtfs-2042 missing_required_field
us-colorado-discover-goodwill-gtfs-2043 missing_required_field
us-colorado-dolores-county-gtfs-1141 missing_required_field
us-colorado-easy-ride-transportation-gtfs-1195 missing_required_field
us-colorado-eco-transit-gtfs-2045 missing_required_field
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_required_field
us-colorado-envida-gtfs-2044 missing_required_field
us-colorado-estes-transit-gtfs-2047 missing_required_field
us-colorado-gunnison-valley-rta-gtfs-2048 missing_required_field
us-colorado-pueblo-transit-gtfs-1232 missing_required_field
us-colorado-steamboat-springs-transit-gtfs-2051 missing_required_field
us-colorado-town-of-telluride-gtfs-2050 missing_required_field
us-colorado-tsc-transit-gtfs-2052 missing_required_field
gb-unknown-uk-aggregate-feed-gtfs-2014 trip_distance_exceeds_shape_distance
New Warnings (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
us-chicago-brown-county-heartland-express-gtfs-2037 fast_travel_between_far_stops
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 fast_travel_between_far_stops
us-colorado-estes-transit-gtfs-2047 fast_travel_between_far_stops
us-chicago-brown-county-heartland-express-gtfs-2037 stop_too_far_from_shape
us-colorado-all-points-transit-gtfs-2039 stop_too_far_from_shape
us-colorado-baca-area-transportation-gtfs-1087 stop_too_far_from_shape
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 stop_too_far_from_shape
us-colorado-estes-transit-gtfs-2047 stop_too_far_from_shape
us-colorado-tsc-transit-gtfs-2052 stop_too_far_from_shape
de-unknown-postauto-ag-gtfs-2053 stop_without_stop_time
pl-podkarpackie-komunikacja-miejska-przemysl-gtfs-1000 stop_without_stop_time
us-chicago-brown-county-heartland-express-gtfs-2037 stop_without_stop_time
us-chicago-the-otter-express-gtfs-2038 stop_without_stop_time
us-colorado-all-points-transit-gtfs-2039 stop_without_stop_time
us-colorado-avon-transit-gtfs-2040 stop_without_stop_time
us-colorado-baca-area-transportation-gtfs-1087 stop_without_stop_time
us-colorado-cripple-creek-transportation-gtfs-2042 stop_without_stop_time
us-colorado-discover-goodwill-gtfs-2043 stop_without_stop_time
us-colorado-dolores-county-gtfs-1141 stop_without_stop_time
us-colorado-easy-ride-transportation-gtfs-1195 stop_without_stop_time
us-colorado-eco-transit-gtfs-2045 stop_without_stop_time
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 stop_without_stop_time
us-colorado-envida-gtfs-2044 stop_without_stop_time
us-colorado-estes-transit-gtfs-2047 stop_without_stop_time
us-colorado-gunnison-valley-rta-gtfs-2048 stop_without_stop_time
us-colorado-pueblo-transit-gtfs-1232 stop_without_stop_time
us-colorado-steamboat-springs-transit-gtfs-2051 stop_without_stop_time
us-colorado-town-of-telluride-gtfs-2050 stop_without_stop_time
us-colorado-envida-gtfs-2044 trip_distance_exceeds_shape_distance_below_threshold
us-colorado-steamboat-springs-transit-gtfs-2051 trip_distance_exceeds_shape_distance_below_threshold
Dropped Warnings (1 out of 1575 datasets, ~0%) ✅

Details of new errors due to code change, which is less than the provided threshold of 1%.

Dataset Notice Code
gb-unknown-uk-aggregate-feed-gtfs-2014 fast_travel_between_consecutive_stops
gb-unknown-uk-aggregate-feed-gtfs-2014 fast_travel_between_far_stops
gb-unknown-uk-aggregate-feed-gtfs-2014 missing_bike_allowance
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_has_too_many_matches_for_shape
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_too_far_from_shape
gb-unknown-uk-aggregate-feed-gtfs-2014 stop_too_far_from_shape_using_user_distance
gb-unknown-uk-aggregate-feed-gtfs-2014 stops_match_shape_out_of_order
gb-unknown-uk-aggregate-feed-gtfs-2014 trip_distance_exceeds_shape_distance_below_threshold

🛡️ Corruption Check

0 out of 1575 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 3.89 4.39 ⬆️+0.50
Median -- 1.38 1.44 ⬆️+0.06
Standard Deviation -- 11.03 22.88 ⬆️+11.85
Minimum in References Reports us-michigan-detroit-people-mover-gtfs-417 0.51 0.60 ⬆️+0.09
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 315.52 852.43 ⬆️+536.91
Minimum in Latest Reports us-california-catalina-express-gtfs-299 0.64 0.51 ⬇️-0.13
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 315.52 852.43 ⬆️+536.91

@jcpitre jcpitre added the do not merge This PR needs more work/discussion or is not meant to be merged label Sep 11, 2024
Copy link
Contributor

This contribution does not follow the conventions set by the Google Java style guide. Please run the following command line at the root of the project to fix formatting errors: ./gradlew goJF.

Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

❌ The rule acceptance test has failed for commit f1239d4
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (18 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_trip_edge
us-chicago-brown-county-heartland-express-gtfs-2037 missing_trip_edge
us-chicago-the-otter-express-gtfs-2038 missing_trip_edge
us-colorado-all-points-transit-gtfs-2039 missing_trip_edge
us-colorado-avon-transit-gtfs-2040 missing_trip_edge
us-colorado-baca-area-transportation-gtfs-1087 missing_trip_edge
us-colorado-cripple-creek-transportation-gtfs-2042 missing_trip_edge
us-colorado-discover-goodwill-gtfs-2043 missing_trip_edge
us-colorado-dolores-county-gtfs-1141 missing_trip_edge
us-colorado-easy-ride-transportation-gtfs-1195 missing_trip_edge
us-colorado-eco-transit-gtfs-2045 missing_trip_edge
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_trip_edge
us-colorado-envida-gtfs-2044 missing_trip_edge
us-colorado-gunnison-valley-rta-gtfs-2048 missing_trip_edge
us-colorado-pueblo-transit-gtfs-1232 missing_trip_edge
us-colorado-steamboat-springs-transit-gtfs-2051 missing_trip_edge
us-colorado-town-of-telluride-gtfs-2050 missing_trip_edge
us-colorado-tsc-transit-gtfs-2052 missing_trip_edge
Dropped Errors (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_required_field
pl-podkarpackie-komunikacja-miejska-przemysl-gtfs-1000 missing_required_field
us-chicago-brown-county-heartland-express-gtfs-2037 missing_required_field
us-chicago-the-otter-express-gtfs-2038 missing_required_field
us-colorado-all-points-transit-gtfs-2039 missing_required_field
us-colorado-avon-transit-gtfs-2040 missing_required_field
us-colorado-baca-area-transportation-gtfs-1087 missing_required_field
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 missing_required_field
us-colorado-cripple-creek-transportation-gtfs-2042 missing_required_field
us-colorado-discover-goodwill-gtfs-2043 missing_required_field
us-colorado-dolores-county-gtfs-1141 missing_required_field
us-colorado-easy-ride-transportation-gtfs-1195 missing_required_field
us-colorado-eco-transit-gtfs-2045 missing_required_field
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_required_field
us-colorado-envida-gtfs-2044 missing_required_field
us-colorado-estes-transit-gtfs-2047 missing_required_field
us-colorado-gunnison-valley-rta-gtfs-2048 missing_required_field
us-colorado-pueblo-transit-gtfs-1232 missing_required_field
us-colorado-steamboat-springs-transit-gtfs-2051 missing_required_field
us-colorado-town-of-telluride-gtfs-2050 missing_required_field
us-colorado-tsc-transit-gtfs-2052 missing_required_field
New Warnings (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
us-chicago-brown-county-heartland-express-gtfs-2037 fast_travel_between_far_stops
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 fast_travel_between_far_stops
us-colorado-estes-transit-gtfs-2047 fast_travel_between_far_stops
us-chicago-brown-county-heartland-express-gtfs-2037 stop_too_far_from_shape
us-colorado-all-points-transit-gtfs-2039 stop_too_far_from_shape
us-colorado-baca-area-transportation-gtfs-1087 stop_too_far_from_shape
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 stop_too_far_from_shape
us-colorado-estes-transit-gtfs-2047 stop_too_far_from_shape
us-colorado-tsc-transit-gtfs-2052 stop_too_far_from_shape
de-unknown-postauto-ag-gtfs-2053 stop_without_stop_time
pl-podkarpackie-komunikacja-miejska-przemysl-gtfs-1000 stop_without_stop_time
us-chicago-brown-county-heartland-express-gtfs-2037 stop_without_stop_time
us-chicago-the-otter-express-gtfs-2038 stop_without_stop_time
us-colorado-all-points-transit-gtfs-2039 stop_without_stop_time
us-colorado-avon-transit-gtfs-2040 stop_without_stop_time
us-colorado-baca-area-transportation-gtfs-1087 stop_without_stop_time
us-colorado-cripple-creek-transportation-gtfs-2042 stop_without_stop_time
us-colorado-discover-goodwill-gtfs-2043 stop_without_stop_time
us-colorado-dolores-county-gtfs-1141 stop_without_stop_time
us-colorado-easy-ride-transportation-gtfs-1195 stop_without_stop_time
us-colorado-eco-transit-gtfs-2045 stop_without_stop_time
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 stop_without_stop_time
us-colorado-envida-gtfs-2044 stop_without_stop_time
us-colorado-estes-transit-gtfs-2047 stop_without_stop_time
us-colorado-gunnison-valley-rta-gtfs-2048 stop_without_stop_time
us-colorado-pueblo-transit-gtfs-1232 stop_without_stop_time
us-colorado-steamboat-springs-transit-gtfs-2051 stop_without_stop_time
us-colorado-town-of-telluride-gtfs-2050 stop_without_stop_time
us-colorado-envida-gtfs-2044 trip_distance_exceeds_shape_distance_below_threshold
us-colorado-steamboat-springs-transit-gtfs-2051 trip_distance_exceeds_shape_distance_below_threshold
Dropped Warnings (0 out of 1575 datasets, ~0%) ✅

No changes were detected due to the code change.

🛡️ Corruption Check

0 out of 1575 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 3.88 4.08 ⬆️+0.20
Median -- 1.40 1.48 ⬆️+0.08
Standard Deviation -- 10.47 11.00 ⬆️+0.53
Minimum in References Reports us-oregon-high-desert-point-gtfs-636 0.49 0.56 ⬆️+0.07
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 281.09 300.17 ⬆️+19.08
Minimum in Latest Reports us-oregon-high-desert-point-gtfs-636 0.49 0.56 ⬆️+0.07
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 281.09 300.17 ⬆️+19.08

Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

✅ The rule acceptance has passed for commit d339411
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (0 out of 1 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Errors (0 out of 1 datasets, ~0%) ✅

No changes were detected due to the code change.

New Warnings (0 out of 1 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Warnings (0 out of 1 datasets, ~0%) ✅

No changes were detected due to the code change.

🛡️ Corruption Check

0 out of 1 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 293.24 296.83 ⬆️+3.59
Median -- 293.24 296.83 ⬆️+3.59
Standard Deviation -- 0.00 0.00 ⬇️0.00
Minimum in References Reports gb-unknown-uk-aggregate-feed-gtfs-2014 293.24 296.83 ⬆️+3.59
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 293.24 296.83 ⬆️+3.59
Minimum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 293.24 296.83 ⬆️+3.59
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 293.24 296.83 ⬆️+3.59

Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

❌ The rule acceptance test has failed for commit 6618a15
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (18 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_trip_edge
us-chicago-brown-county-heartland-express-gtfs-2037 missing_trip_edge
us-chicago-the-otter-express-gtfs-2038 missing_trip_edge
us-colorado-all-points-transit-gtfs-2039 missing_trip_edge
us-colorado-avon-transit-gtfs-2040 missing_trip_edge
us-colorado-baca-area-transportation-gtfs-1087 missing_trip_edge
us-colorado-cripple-creek-transportation-gtfs-2042 missing_trip_edge
us-colorado-discover-goodwill-gtfs-2043 missing_trip_edge
us-colorado-dolores-county-gtfs-1141 missing_trip_edge
us-colorado-easy-ride-transportation-gtfs-1195 missing_trip_edge
us-colorado-eco-transit-gtfs-2045 missing_trip_edge
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_trip_edge
us-colorado-envida-gtfs-2044 missing_trip_edge
us-colorado-gunnison-valley-rta-gtfs-2048 missing_trip_edge
us-colorado-pueblo-transit-gtfs-1232 missing_trip_edge
us-colorado-steamboat-springs-transit-gtfs-2051 missing_trip_edge
us-colorado-town-of-telluride-gtfs-2050 missing_trip_edge
us-colorado-tsc-transit-gtfs-2052 missing_trip_edge
Dropped Errors (20 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
de-unknown-postauto-ag-gtfs-2053 missing_required_field
us-chicago-brown-county-heartland-express-gtfs-2037 missing_required_field
us-chicago-the-otter-express-gtfs-2038 missing_required_field
us-colorado-all-points-transit-gtfs-2039 missing_required_field
us-colorado-avon-transit-gtfs-2040 missing_required_field
us-colorado-baca-area-transportation-gtfs-1087 missing_required_field
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 missing_required_field
us-colorado-cripple-creek-transportation-gtfs-2042 missing_required_field
us-colorado-discover-goodwill-gtfs-2043 missing_required_field
us-colorado-dolores-county-gtfs-1141 missing_required_field
us-colorado-easy-ride-transportation-gtfs-1195 missing_required_field
us-colorado-eco-transit-gtfs-2045 missing_required_field
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 missing_required_field
us-colorado-envida-gtfs-2044 missing_required_field
us-colorado-estes-transit-gtfs-2047 missing_required_field
us-colorado-gunnison-valley-rta-gtfs-2048 missing_required_field
us-colorado-pueblo-transit-gtfs-1232 missing_required_field
us-colorado-steamboat-springs-transit-gtfs-2051 missing_required_field
us-colorado-town-of-telluride-gtfs-2050 missing_required_field
us-colorado-tsc-transit-gtfs-2052 missing_required_field
New Warnings (21 out of 1575 datasets, ~1%) ❌

Details of new errors due to code change, which is above the provided threshold of 1%.

Dataset Notice Code
us-chicago-brown-county-heartland-express-gtfs-2037 fast_travel_between_far_stops
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 fast_travel_between_far_stops
us-colorado-estes-transit-gtfs-2047 fast_travel_between_far_stops
us-chicago-brown-county-heartland-express-gtfs-2037 stop_too_far_from_shape
us-colorado-all-points-transit-gtfs-2039 stop_too_far_from_shape
us-colorado-baca-area-transportation-gtfs-1087 stop_too_far_from_shape
us-colorado-blackhawk-and-central-city-tramway-gtfs-2041 stop_too_far_from_shape
us-colorado-estes-transit-gtfs-2047 stop_too_far_from_shape
us-colorado-tsc-transit-gtfs-2052 stop_too_far_from_shape
de-unknown-postauto-ag-gtfs-2053 stop_without_stop_time
pl-podkarpackie-komunikacja-miejska-przemysl-gtfs-1000 stop_without_stop_time
us-chicago-brown-county-heartland-express-gtfs-2037 stop_without_stop_time
us-chicago-the-otter-express-gtfs-2038 stop_without_stop_time
us-colorado-all-points-transit-gtfs-2039 stop_without_stop_time
us-colorado-avon-transit-gtfs-2040 stop_without_stop_time
us-colorado-baca-area-transportation-gtfs-1087 stop_without_stop_time
us-colorado-cripple-creek-transportation-gtfs-2042 stop_without_stop_time
us-colorado-discover-goodwill-gtfs-2043 stop_without_stop_time
us-colorado-dolores-county-gtfs-1141 stop_without_stop_time
us-colorado-easy-ride-transportation-gtfs-1195 stop_without_stop_time
us-colorado-eco-transit-gtfs-2045 stop_without_stop_time
us-colorado-el-paso-fountain-valley-senior-citizens-program-inc-gtfs-2046 stop_without_stop_time
us-colorado-envida-gtfs-2044 stop_without_stop_time
us-colorado-estes-transit-gtfs-2047 stop_without_stop_time
us-colorado-gunnison-valley-rta-gtfs-2048 stop_without_stop_time
us-colorado-pueblo-transit-gtfs-1232 stop_without_stop_time
us-colorado-steamboat-springs-transit-gtfs-2051 stop_without_stop_time
us-colorado-town-of-telluride-gtfs-2050 stop_without_stop_time
us-colorado-envida-gtfs-2044 trip_distance_exceeds_shape_distance_below_threshold
us-colorado-steamboat-springs-transit-gtfs-2051 trip_distance_exceeds_shape_distance_below_threshold
Dropped Warnings (0 out of 1575 datasets, ~0%) ✅

No changes were detected due to the code change.

🛡️ Corruption Check

0 out of 1575 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 3.90 4.26 ⬆️+0.36
Median -- 1.40 1.47 ⬆️+0.06
Standard Deviation -- 10.51 14.17 ⬆️+3.66
Minimum in References Reports ph-unknown-hm-transport-inc-and-robinsons-malls-gtfs-1105 0.49 0.57 ⬆️+0.08
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 284.45 293.00 ⬆️+8.55
Minimum in Latest Reports us-oregon-hut-airport-shuttle-gtfs-635 0.60 0.53 ⬇️-0.08
Maximum in Latest Reports ie-unknown-bus-eireann-gtfs-941 13.71 367.06 ⬆️+353.36

@jcpitre
Copy link
Contributor Author

jcpitre commented Sep 12, 2024

With the new 12G heap the acceptance tests seem to run properly.

@jcpitre jcpitre removed the do not merge This PR needs more work/discussion or is not meant to be merged label Sep 12, 2024
@jcpitre jcpitre merged commit cdcf921 into master Sep 13, 2024
334 of 335 checks passed
@jcpitre jcpitre deleted the 1749-flex-add-location-group-schema-and-update-stop-time-schema branch September 13, 2024 18:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Flex: Add location group schema and update stop time schema
3 participants