Skip to main content

Questions tagged [monitoring]

The tag has no usage guidance.

1 vote
2 answers
90 views

CDN monitoring for cost distribution [closed]

I'm aggregating a lot of data from different vendors and have to provide an unified data set to a set of clients. The clients will be identified with a specific identifier upon fetching the data from ...
Banana's user avatar
  • 141
-2 votes
2 answers
170 views

Monitor newly released code without real traffic

As new code is released to production, without enough direct access to production execution environments I rely only on user traffic for monitoring. However not all use cases get covered via user ...
sam kirubaharan's user avatar
0 votes
1 answer
134 views

AWS and cron job: improve logging, monitoring and concurrence

In my company, we use a separate cron server to run mission-critical background jobs that run on a single ec2 instance: the whole platform is then vulnerable to anything going wrong on this instance. ...
friday-json's user avatar
1 vote
0 answers
135 views

Centralized management of many single-tenant instances

Designing a SaaS application that will be used by multiple business customers (tenants). Decided to use the single tenant model - a single instance of the application and database is deployed for each ...
Alex's user avatar
  • 11
3 votes
1 answer
386 views

Defining SLI / SLO for ETL and Reporting Application

All, We're just started on SRE journey and trying to define SLI / SLO for our application. It is an ETL application where 1. feeds (e.g. start of day, end of day data feeds) comes from various ...
Ravi Parekh's user avatar
-2 votes
1 answer
150 views

What approaches are standard for 'healthcheck' monitoring of frontends?

A problem I'm currently solving is that at a glance I want to see the status of various services, including both frontends and backends, across dev, test and prod environments. That is, I want to see: ...
dwjohnston's user avatar
  • 2,563
2 votes
2 answers
130 views

Design a function that indicates significant deviations in response times

I'd need some feedback on how to approach the design of a function that highlights parts of a time series chart. The chart shows the response time of an application, in particular the 90th percentile, ...
BMBM's user avatar
  • 337
-1 votes
1 answer
467 views

How to objectively monitor REST APIs performance based on the request/response size

We recently integrated a monitoring solution (Data Dog) for our REST-APIs based microservices. The monitoring solution gives us many useful metrics. For example, we can see the average response time ...
Hawk's user avatar
  • 487
0 votes
1 answer
655 views

How to add APM/observability/tracing in a monolithic architecture?

I keep reading articles analyzing Monitoring and Observability, or having lots and lots of text regarding how the latter is the extension of the first, or how they are complementary, or how tracing is ...
Astyanax's user avatar
  • 324
-4 votes
1 answer
152 views

A request's round trip time monitoring at microservice production best tools and practices?

I've a microservice's system at production which provide an API to the outer world. Along with each service monitoring via monitoring tool ( Prometheus, zabbix, etc.) of course it's highly desirable ...
cpu's user avatar
  • 1
-1 votes
1 answer
188 views

Show a service's status to the user

Some design thoughts on an application that is mostly event driven using kubernetes and docker. The application is a web based application, a single page one. It shows updates from the back-end such ...
ring bearer's user avatar
2 votes
0 answers
143 views

What is the most efficient way to find filesystem changes in percentage?

I am planning to write a program that will produce digest logs for file changes in a particular directory. The idea is that whenever a file is deleted in a particular directory, it will show up in the ...
Michael John's user avatar
1 vote
3 answers
84 views

Monitoring write-intensive web service through submitting synthesized data?

We have a web service that accepts images and metadata uploaded by end users, and the uploaded images would go through multiple steps of processing / reviewing with human involved. To monitor the ...
liuyu's user avatar
  • 113
0 votes
0 answers
53 views

Do logs enhance availability of service for a well monitored application?

I used to work for teams that built software as a service applications. Our requirements, regarding production, were often the same : A complex service (web application, database, daemons, typically) ...
Diane M's user avatar
  • 2,076
1 vote
0 answers
49 views

How should propagation of service abuse information work within a microservice architecture?

At the moment, I'm working on a product that's being broken down from a monolith to a bunch of microservices, and it seems to be going well enough. However, if a user is abusing the service somehow, ...
Parthian Shot's user avatar

15 30 50 per page