monitoringalerting #37

wjro · 2025-05-15T19:23:40Z

wjro commented

2025-05-15 19:23:40 +00:00

WIP: Something here feels off to me, the config fields for which services are deployed through via the helm chart are not easily modifiable. Since harmony is opinionated I,m not sure how much of a problem that is. Do we want the end user to have the liberty/responsibility to chose which services are deployed by default or not

wjro added 2 commits 2025-05-15 19:23:41 +00:00

feat: added default monitoringStackScore implementation b4c6848433

chore: modified build config to be able to pass namespace to the config eb8a8a2e04

wjro added 1 commit 2025-05-15 19:31:32 +00:00

fix merge conflict f7d3da3ac9

johnride requested changes 2025-05-15 20:33:30 +00:00

johnride left a comment

Good improvement over the first version, still some work to do to have it really look good. Keep it up!

harmony/src/modules/monitoring/kube_prometheus.rs Outdated

						
				@ -34,1 +34,4 @@

				"#;

				    let mut values_overrides: HashMap<NonBlankString, String> = HashMap::new();

				    macro_rules! insert_flag {

johnride commented

2025-05-15 19:41:50 +00:00

Using a macro here is weird at the very least.

It makes code not easily readable insert_flag is way less obvious to me what it does than values_overrides.insert(...) would

Use a function if you want to make the NonBlankString boilerplate more readable. And if you want to write a macro that does something valuable you should validate the NonBlankString at compile time like we do with out ip! and mac_address! macros.

Using a macro here is weird at the very least. It makes code not easily readable `insert_flag` is way less obvious to me what it does than `values_overrides.insert(...)` would Use a function if you want to make the NonBlankString boilerplate more readable. And if you want to write a macro that does something valuable you should validate the NonBlankString at compile time like we do with out ip! and mac_address! macros.

harmony/src/modules/monitoring/kube_prometheus.rs Outdated

						
				@ -43,3 +75,3 @@

				        chart_version: None,

				        values_overrides: None,

				        values_overrides: Some(values_overrides),

				        values_yaml: Some(values.to_string()),

johnride commented

2025-05-15 19:54:34 +00:00

Why use values_overrides here when there already is a values_yaml right above? Would make it a lot more readable I think.

harmony/src/modules/monitoring/monitoring_alerting.rs Outdated

						
				@ -11,0 +12,4 @@

				use super::{config::KubePrometheusConfig, kube_prometheus::kube_prometheus_helm_chart_score};

				#[derive(Debug, Clone, Serialize)]

				pub enum AlertChannel {

johnride commented

2025-05-15 20:00:34 +00:00

Nice, but did you test smtp? If it's not working yet there should be a // TODO comment or something like that.

I would expect to see server, username, password or something like that here. Then there is a bajillion providers with no standards. Emails are not a simple topic. Maybe we should even only support Webhook for now.

Nice, but did you test smtp? If it's not working yet there should be a // TODO comment or something like that. I would expect to see server, username, password or something like that here. Then there is a bajillion providers with no standards. Emails are not a simple topic. Maybe we should even only support Webhook for now.

harmony/src/modules/monitoring/monitoring_alerting.rs

						
				@ -11,0 +19,4 @@

				    },

				    Smpt {

				        email_address: EmailAddress,

				        service_name: String,

johnride commented

2025-05-15 19:43:21 +00:00

What is service_name used for? As this is user facing, a bit of rust doc (with triple slashes ///) to describe how to use would be useful here.

johnride marked this conversation as resolved

harmony/src/modules/monitoring/monitoring_alerting.rs

						
				@ -16,2 +34,2 @@

				    pub monitoring_stack: HelmChartScore,

				    pub namespace: String,

				    pub alert_channel: Option<AlertChannel>,

				    pub monitoring_stack: Stack,

johnride commented

2025-05-15 19:47:52 +00:00

No need for this monitoring_stack: Stack field for now. This is a case of YAGNI. We support only one type and I don't see in the very short term a use case that would force us.

Ron Jeffries, a co-founder of XP, explained the philosophy: "Always implement things when you actually need them, never when you just foresee that you [will] need them."[8] John Carmack wrote "It is hard for less experienced developers to appreciate how rarely architecting for future requirements / applications turns out net-positive."[9]
https://en.wikipedia.org/wiki/You_aren%27t_gonna_need_it

Another note, I'm realising now something that slipped by me yesterday :

I think we should have two scores for monitoring : cluster monitoring and app monitoring. The one we want with LampScore is ApplicationMonitoringScore. It comes with quite a bit of boilerplate but at least it makes sense to deploy it anywhere. Then we also provide a ClusterMonitoringScore. Under the hood they could very well use the same Interpret and work together efficiently.

No need for this `monitoring_stack: Stack` field for now. This is a case of YAGNI. We support only one type and I don't see in the very short term a use case that would force us. > Ron Jeffries, a co-founder of XP, explained the philosophy: "Always implement things when you actually need them, never when you just foresee that you [will] need them."[8] John Carmack wrote "It is hard for less experienced developers to appreciate how rarely architecting for future requirements / applications turns out net-positive."[9] > https://en.wikipedia.org/wiki/You_aren%27t_gonna_need_it --- Another note, I'm realising now something that slipped by me yesterday : I think we should have two scores for monitoring : cluster monitoring and app monitoring. The one we want with LampScore is ApplicationMonitoringScore. It comes with quite a bit of boilerplate but at least it makes sense to deploy it anywhere. Then we also provide a ClusterMonitoringScore. Under the hood they could very well use the same Interpret and work together efficiently.

johnride marked this conversation as resolved

harmony/src/modules/monitoring/monitoring_alerting.rs Outdated

						
				@ -32,3 +47,1 @@

				        Self {

				            monitoring_stack: kube_prometheus_score(ns),

				            namespace: ns.to_string(),

				    fn match_alert_channel(&self, config: &mut KubePrometheusConfig) {

johnride commented

2025-05-15 20:12:02 +00:00

Poor function name. What's the use of this function?

Something like configure_alerts maybe?

Poor function name. What's the use of this function? Something like `configure_alerts` maybe?

harmony/src/modules/monitoring/monitoring_alerting.rs Outdated

						
				@ -35,0 +47,4 @@

				    fn match_alert_channel(&self, config: &mut KubePrometheusConfig) {

				        if let Some(alert_channel) = &self.alert_channel {

				            match alert_channel {

				                //opt1

johnride commented

2025-05-15 19:50:29 +00:00

Useless comments

wjro added 2 commits 2025-05-20 19:59:06 +00:00

feat: install discord alert manager helm chart when Discord is the chosen alerting channel e80752ea3f

feat: added monitoring stack example to lamp demo 7fc2b1ebfe

wjro added 1 commit 2025-05-20 20:05:46 +00:00

fix:merge confict 2b6d2e8606

johnride approved these changes 2025-05-21 16:34:57 +00:00

johnride left a comment

The name MonitoringAlertingStackScore bothered me all along but I couldn't figure out why. Now I did :

The word Stack.

It should be MonitoringAlertingScore or TelemetryScore or something like that.

Our goal with this is to provide a functionnality to the app developers. This functionnality is not a specific stack, but only the Monitoring and Alerting features.

And now that I think of it, maybe they should be separate : MonitoringScore and AlertingScore.

Both could depend on the same feature provider under the hood. Could be prometheus grafana, hyperdx, datadog, etc.

Let's merge this as-is for now but I think we have a nice opportunity for a great design here!

The name MonitoringAlertingStackScore bothered me all along but I couldn't figure out why. Now I did : The word Stack. It should be MonitoringAlertingScore or TelemetryScore or something like that. Our goal with this is to provide a functionnality to the app developers. This functionnality is not a specific stack, but only the Monitoring and Alerting features. And now that I think of it, maybe they should be separate : MonitoringScore and AlertingScore. Both could depend on the same feature provider under the hood. Could be prometheus grafana, hyperdx, datadog, etc. Let's merge this as-is for now but I think we have a nice opportunity for a great design here!

wjro merged commit 19bd47a545 into master

2025-05-21 17:32:26 +00:00

wjro referenced this issue from a commit

2025-05-21 17:32:28 +00:00

Merge pull request 'monitoringalerting' (#37) from monitoringalerting into master

Sign in to join this conversation.

No reviewers