View Comments: Newest First|Oldest First|Threaded View
<<  <  Page 2/2
User Rank
Re: Big mistake
phgphd   9/16/2013 3:15:36 PM
Your point is well taken.  Thank you for your honesty.

John E
User Rank
When to do complete regressions
John E   9/16/2013 11:07:34 AM
I make this decision on almost every project my team works on.  Our policy is to do a full test pass on the final FW and HW revisions before we release a product.  The reality is that we are often pressed for time and must make a risk assesment on the "small change" that the FW team did at the last minute.  We often choose to do a focused regression pass, and not the full test sweep; but we do it knowing we are taking a risk, and make sure that the project team agrees.  We build industrial products, not medical or military, so the risk is more contained.

User Rank
Re: Big mistake
BrainiacV   9/16/2013 11:03:37 AM
I agree with you, the best advise I've ever gotten has been people telling me their horror stories.  Many a time when stuck on a problem, I'd reflect on some similar problem I had been told and it would lead to a solution.

One time, just before I left for a customer's site across the country to install some software I had written, my manager had been telling me of an instance he had found the hardware engineer had addressed all the interface cards to the same location.

Later on the job site, my program was working, but every once in a while the system would mysteriously crash.  I had to develop the software without the benefit of having a computer that ran the OS, nor did I have access to the hardware itself.  Since I only had the spec sheets, I wrote the program very conservatively and since it was interrupt driven, I didn't know how long between receiving and processing the interrupt, that the background process would get control to deal with the data.  So I placed everything in ring buffers.

After the program crashed again, I checked the buffers and found status values that should not have been there.  Remembering the problem my manager had, I chased down the hardware engineer and had him check the interface board addresses.  They had several modems in addition to the serial interface I was talking to.  It turned out one of the modems had the same address.  When I'd query the hardware, more often than not, a modem would respond instead of the serial interface.

What was crashing the system was that eventually an interrupt would be missed and the system would hang waiting for a return that would never come.

I doubt that it would occured to me to have the addresses checked if my manager hadn't just happened to mention a similar problem he had solved.

User Rank
Bad missile
Battar   9/16/2013 9:14:37 AM
Friend of mine worked on missile development somewhere in the world. When they did a small navigational software update they re-ran all the tests with the missile attached to the aircraft wing - on the ground. When a customer fired one of these missiles in a war zone (cant go into details) it went ballistic, hit something it shouldn't have, and caused a huge diplomatic fuss. Investigation showed that with a strong crosswind, the navigation loop didn't complete before the it was interrupted by the next sensor update, and the missile effectively got lost. An air test would probably have shown up the fault. Moral is , as NASA always say, test as you fly, and fly as you test. 

User Rank
regression test
vimalkumarp   9/15/2013 2:16:51 AM
 in a mission critical, life critical application regression test must never be bypassed.

User Rank
Big mistake
armorris   9/13/2013 2:17:29 PM
I don't know why I didn't send the technician back to test the entire circuit card. I guess I just thought that the simple timer circuit could not have affected other circuits. I got really chewed out for that mistake. Needless to say, I never made that mistake again. That was fairly early on in my career and the first time I had ever redesigned a circuit in a piece of flight hardware.

I'm sure that there are plenty of engineers who have made mistakes that they will never tell anybody about. Telling this story can't affect my career as I'm retired, but it might prevent some young engineer from making the same mistake.

TJ McDermott
User Rank
Re: Suprised
TJ McDermott   9/13/2013 11:46:48 AM
Regression testing is necessary for everything.  Rockwell Automation routinely delays firmware releases because of regression test failures.  One can grumble about the delay (and I do), but I am also very appreciative that Rockwell DOES do exhaustive regression testing.

User Rank
Semiconductors Too
tekochip   9/13/2013 10:49:17 AM
When I was with a  semiconductor company we used to see similar issues as the components went through a die revision.  the new part would meet all the specifications of the old part, but somewhere done the line there was a circuit using an unspecified feature of the old part and suddenly their circuit wouldn't work any longer. 
The worst case example is a customer I had that used the part above the absolute specified supply rating.  He tested the parts and burned them in and was happy accepting a small failure rate because it saved him buying a regulator on his board.  A new die revision no longer allowed him to be as frugal.

User Rank
naperlou   9/13/2013 9:11:37 AM
Andrew, I am really suprised at the testing done on your change.  I worked on spacecraft and other military systems.  Any change would require a complete regression test, especially in a weapn system or man critical system, in my experience.

<<  <  Page 2/2

Partner Zone
Latest Analysis
A Frost & Sullivan study finds that increased cyber attacks are prompting a flurry of innovative protection tools.
Devices and interconnected systems are finding a foothold not only in our homes but in mainstream organizations. Here are three tips to mitigate the risk.
What makes this movie stand out from the typical high school sports story is that the teenagers are undocumented immigrants, and the big game is a NASA-sponsored marine robotics competition. Like many other Hollywood movies, however, Spare Parts only tells part of the story. What the film shows -- and doesn’t show -- raises important issues affecting STEM education in the US.
Instead of sifting through huge amounts of technical data looking for answers to assembly problems, engineers can now benefit from 3M's new initiative -- 3M Assembly Solutions. The company has organized its wealth of adhesive and tape solutions into six typical application areas, making it easier to find the best products to solve their real-world assembly and bonding problems.
Load dump occurs when a discharged battery is disconnected while the alternator is generating current and other loads remain on the alternator circuit. If left alone, the electrical spikes and transients will be transmitted along the power line, leading to malfunctions in individual electronics/sensors or permanent damage to the vehicle’s electronic system. Bottom line: An uncontrolled load dump threatens the overall safety and reliability of the vehicle.
Design News Webinar Series
2/25/2015 11:00 a.m. California / 2:00 p.m. New York
12/11/2014 8:00 a.m. California / 11:00 a.m. New York
12/10/2014 8:00 a.m. California / 11:00 a.m. New York
3/31/2015 11:00 a.m. California / 2:00 p.m. New York
Quick Poll
The Continuing Education Center offers engineers an entirely new way to get the education they need to formulate next-generation solutions.
Mar 30 - Apr3, Getting Hands-On with Cypress’ PSoC
SEMESTERS: 1  |  2  |  3  |  4  |  5  |  67

Focus on Fundamentals consists of 45-minute on-line classes that cover a host of technologies. You learn without leaving the comfort of your desk. All classes are taught by subject-matter experts and all are archived. So if you can't attend live, attend at your convenience.
Learn More   |   Login   |   Archived Classes
Twitter Feed
Design News Twitter Feed
Like Us on Facebook

Sponsored Content

Technology Marketplace

Copyright © 2015 UBM Canon, A UBM company, All rights reserved. Privacy Policy | Terms of Service