parsing baseball files in Rust instead of Python for an 8x speedup!

2021-01-16T14:07:39-06:00

One way to solve the `Report` and `StatReport` problem is to reverse it: instead of having `StatReport` demand `Report`, it implements it. So then you have:

“`rust
trait Report {
fn different_for_each_type(..);
fn same_for_many_types(..);
}
trait StatReport {
fn different_for_each_type_impl(..);
fn another_method(..);
}
impl Report for T {
pub fn different_for_each_type(..) {
different_for_each_type_impl(..)
}

pub fn same_for_many_types(..) { .. }
}
“`

LikeLiked by 1 person

Reply

2021-01-16T14:16:54-06:00

Oh, that’s clever! I guess the downside is that the StatsReport trait then needs another version of every method that different for each type, but maybe that’s just because I’m used to thinking of things from an object-oriented language perspective. I’m going to try this when I have a chance, thanks!

(and if this is the Amos who is @fasterthanlime, I love your writing; it’s one of the things that got me excited about Rust!)

LikeLike

Reply

2021-01-17T10:40:15-06:00

No, sadly that’s not me, even though that’s a rather uncommon name. I also like that blog 🙂

LikeLiked by 1 person

2021-01-17T10:46:50-06:00

About the repeated methods: you can also have a third trait, containing those methods, for which the two others are supertraits. Then it would look like this:

	trait ReportCore {
	fn different_for_each_type(..);
	}

	trait Report : ReportCore {
	fn same_for_many_types(..);
	}

	trait StatReport : ReportCore {
	fn another_method(..);
	}

	impl<T: StatReport> Report for T {
	pub fn same_for_many_types(..) { .. }
	}

view raw

playground.rs

hosted with ❤ by GitHub

I’m not sure if it’s worth the extra complexity, but it is with less repetition.

LikeLike

2021-01-21T12:34:29-06:00

Hah, I guess the Rust community is big enough for two Amos’s now 🙂

This is interesting, I’ll have to try it. Thanks – I’m learning a lot!

LikeLike

2021-01-16T14:11:54-06:00

Apparently WP doesn’t support markdown 😔(hopefully it supports emojis), and swallows > < (“html tags”)
The impl line should be
impl > T: StatReport < Report for T

LikeLiked by 1 person

Reply

2021-01-16T15:06:44-06:00

Regarding the `merge_into` and `Clone` issues, they both stem from the fact that you’re doing dynamic dispatch. This limits your methods to object-safe ones, which means that `Self` can’t appear except as the receiver – which isn’t true for `clone` or for your “ideal” `merge_into` (which would look something like `fn merge_into(&self, other: &mut Self)` ). The question is, do you really need dynamic dispatch? Looking at your main, it seems you’re choosing a single report for most options, and two for one of them. If you only choose single options, then you can extract most of your main to a function which is generic over the specific report type, avoiding both dynamic dispatch problems. If you really need the double report, you can do something like

	struct<R1, R2> DoubleReport(R1, R2);
	impl<R1: Report, R2: Report> Report for DoubleReport<R1, R2> {
	fn report_method(&self, ..) {
	self.0.report_method(..);
	self.1.report_method(..);
	}
	}

view raw

playground.rs

hosted with ❤ by GitHub

This isn’t very scalable to multiple reports, but might be the right solution here.

LikeLike

Reply

Pingback: Parsing baseball files in Rust instead of Python for an 8x speedup – Full-Stack Feed

2021-01-21T10:55:42-06:00

Just a small heads up, in the Cargo.toml file in the github repository, optimizations are enabled. You might be able to speed up the code by adding the following:

[profile.release]
opt-level = 3

LikeLiked by 1 person

Reply

2021-01-21T13:13:02-06:00

Thanks for the suggestion! I just added this in commit f7c67928, but it doesn’t seem to help performance any. Worth a shot!

LikeLike

Reply

Pingback: Adding 2020 baseball games to the win expectancy finder – Greg's Blog

Pingback: my 2021 in review – Greg's Blog

Pingback: Writing for the LogRocket blog! – Greg's Blog

	Time (in seconds)
Python single core implementation	165
Python multicore implementation	35
Rust single core implementation (commit e76de817)	20
Rust multicore implementation	4 (!)

parsing baseball files in Rust instead of Python for an 8x speedup!

13 thoughts on “parsing baseball files in Rust instead of Python for an 8x speedup!”

Leave a comment Cancel reply

Share this:

13 thoughts on “parsing baseball files in Rust instead of Python for an 8x speedup!”

Leave a comment Cancel reply