Questions tagged [monitoring]
The monitoring tag has no usage guidance.
31
questions
1
vote
2
answers
90
views
CDN monitoring for cost distribution [closed]
I'm aggregating a lot of data from different vendors and have to provide an unified data set to a set of clients. The clients will be identified with a specific identifier upon fetching the data from ...
-2
votes
2
answers
170
views
Monitor newly released code without real traffic
As new code is released to production, without enough direct access to production execution environments I rely only on user traffic for monitoring. However not all use cases get covered via user ...
0
votes
1
answer
134
views
AWS and cron job: improve logging, monitoring and concurrence
In my company, we use a separate cron server to run mission-critical background jobs that run on a single ec2 instance: the whole platform is then vulnerable to anything going wrong on this instance.
...
1
vote
0
answers
135
views
Centralized management of many single-tenant instances
Designing a SaaS application that will be used by multiple business customers (tenants). Decided to use the single tenant model - a single instance of the application and database is deployed for each ...
3
votes
1
answer
386
views
Defining SLI / SLO for ETL and Reporting Application
All,
We're just started on SRE journey and trying to define SLI / SLO for our application.
It is an ETL application where 1. feeds (e.g. start of day, end of day data feeds) comes from various ...
-2
votes
1
answer
150
views
What approaches are standard for 'healthcheck' monitoring of frontends?
A problem I'm currently solving is that at a glance I want to see the status of various services, including both frontends and backends, across dev, test and prod environments.
That is, I want to see:
...
2
votes
2
answers
130
views
Design a function that indicates significant deviations in response times
I'd need some feedback on how to approach the design of a function that highlights parts of a time series chart. The chart shows the response time of an application, in particular the 90th percentile, ...
-1
votes
1
answer
467
views
How to objectively monitor REST APIs performance based on the request/response size
We recently integrated a monitoring solution (Data Dog) for our REST-APIs based microservices. The monitoring solution gives us many useful metrics. For example, we can see the average response time ...
0
votes
1
answer
655
views
How to add APM/observability/tracing in a monolithic architecture?
I keep reading articles analyzing Monitoring and Observability, or having lots and lots of text regarding how the latter is the extension of the first, or how they are complementary, or how tracing is ...
-4
votes
1
answer
152
views
A request's round trip time monitoring at microservice production best tools and practices?
I've a microservice's system at production which provide an API to the outer world. Along with each service monitoring via monitoring tool ( Prometheus, zabbix, etc.) of course it's highly desirable ...
-1
votes
1
answer
188
views
Show a service's status to the user
Some design thoughts on an application that is mostly event driven using kubernetes and docker.
The application is a web based application, a single page one. It shows updates from the back-end such ...
2
votes
0
answers
143
views
What is the most efficient way to find filesystem changes in percentage?
I am planning to write a program that will produce digest logs for file changes in a particular directory. The idea is that whenever a file is deleted in a particular directory, it will show up in the ...
1
vote
3
answers
84
views
Monitoring write-intensive web service through submitting synthesized data?
We have a web service that accepts images and metadata uploaded by end users, and the uploaded images would go through multiple steps of processing / reviewing with human involved.
To monitor the ...
0
votes
0
answers
53
views
Do logs enhance availability of service for a well monitored application?
I used to work for teams that built software as a service applications. Our requirements, regarding production, were often the same :
A complex service (web application, database, daemons, typically) ...
1
vote
0
answers
49
views
How should propagation of service abuse information work within a microservice architecture?
At the moment, I'm working on a product that's being broken down from a monolith to a bunch of microservices, and it seems to be going well enough.
However, if a user is abusing the service somehow, ...